Jess Cox & Marta Diepenbroek 53 min

Breakthroughs in Forensic DNA Analysis Using Next-Generation Sequencing


The growing use of NGS in forensic genetic applications has resulted in the analysis of alternative markers that can complement CE-STR results for challenging samples- bones, mixtures, or degraded samples- to generate investigative leads. Marta Diepenbroek will discuss her collaborative work with colleagues from Poland and Austria, which concerns the first genetic and phylogeographic evidence for Jewish Holocaust victims found at the former Nazi death camp in Sobibór. Diepenbroek will discuss show how forensic analysis of haploid markers can shed light on historical events. Jesse Cox will describe the underpinnings of an approach to deconvolute mixtures of two individuals to provide ancestry and phenotypic information via NGS for both a major and minor contributor. In this webinar you’ll hear about: NGS analysis of the mitochondrial genome can be combined with CE-based Y STR markers to enable lineage determination of remains SNP analysis by NGS is used to deconvolute mixtures for investigative lead generation



0:00

Hello, and welcome to Breakthroughs and Forensic DNA Analysis using Next

0:04

Generation Sequencing,

0:06

brought to you by Forensic and sponsored by Thermo Fisher Scientific.

0:09

This is the fifth and final webinar in the 2021 Future Trends and Forensic DNA

0:14

Technology

0:15

Series.

0:16

My name is Michelle Taylor, Editor-in-Chief of Forensic and I will be your

0:18

moderator throughout.

0:20

For today's webinar, you can earn one hour of continuing education credit.

0:24

Following the conclusion of the webinar, you will receive an email with

0:27

information on

0:28

how to obtain CE Credit Documentation.

0:31

We have a great lineup scheduled to present to you today, but before we begin,

0:35

I'd like

0:35

to take just a moment to cover a few logistics.

0:38

At the end of the presentation, we will hold a question and answer session.

0:42

To ask a question, click on the "Ask a Question" tab in the upper right corner

0:46

of

0:46

your screen.

0:47

Please also take note that the right side of the screen features an overview of

0:50

today's

0:50

webinar as well as more information about our speakers.

0:54

If you have a technical question during today's event, click on the "Test your

0:57

Connection"

0:58

button on the bottom of your screen.

1:00

From there, you can access additional webinar support.

1:03

We also invite you to use a social media widget beneath the webinar to share

1:07

with your friends

1:07

and colleagues.

1:09

Today, you will hear from Marta Deepenbroke, a post-doctoral scholar at the

1:14

Ludwig Maximilian

1:15

University in Munich.

1:17

Marta will discuss her collaborative work with colleagues from Poland and

1:20

Austria concerning

1:21

the first genetic and phylogeographic evidence for Jewish Holocaust victims

1:26

found at the

1:26

former Nazi death camp in Sobobor.

1:30

For the past six years, Marta was a team member of the Polish genetic database

1:34

of victims

1:35

of Tolitarianisms, a DVI project with the purpose of employing forensic methods

1:40

to identify

1:41

victims of communist and Nazi regimes in Poland.

1:44

Currently, her target is the practical use of state-of-the-art forensic methods

1:49

in routine

1:50

investigations and presenting them to law enforcement.

1:54

Her work focuses on the newest MPS-based solutions.

1:57

We'll also hear from Jesse Cox, assistant professor at the University of

2:02

Nebraska Medical

2:03

Center.

2:04

Jesse will describe the underpinnings of an approach to de-combolute mixtures

2:07

of two individuals

2:09

to provide ancestry and phenotypic information via NGS for both a major and

2:14

minor contributor.

2:16

Jesse completed his AP/CP Pathology Residency and Molecular Genetic Pathology

2:21

Fellowship

2:22

training at the University of Nebraska Medical Center in Omaha.

2:26

He currently serves as director of the Human DNA Identification Laboratory and

2:30

HLA Tissue

2:31

Typing Laboratories at UNMC.

2:34

Thank you for joining us for the final webinar in the 2021 Future Trends in

2:38

Forensic DNA

2:39

Technology Series, and we hope you have enjoyed this series as much as we have.

2:44

Be sure to check your email earlier in the new year, because we will be

2:47

continuing this

2:48

series.

2:49

Now, without further ado, let's get underway.

2:52

Welcome, everyone.

2:53

My name is Matha Deepindrok.

2:55

I currently work as a product researcher in forensic genomics and the Institute

3:00

of Human

3:01

Disease at LMEU.

3:02

Today, it is my pleasure to present the results of a collaborative work I would

3:09

call it from

3:10

Poland and Austria.

3:11

And during my presentation, I will discuss how our genetic and phylogenographic

3:17

analysis

3:18

shed light on the historical events and the solvable death camp, because I

3:23

believe that

3:24

as forensic scientists for the death and the living, we must bear witness.

3:30

According to the last year's survey, conducted among young Americans, almost 70

3:36

% of them

3:36

are not aware how many Jews were killed during the Holocaust and what is more

3:42

some stated

3:43

that it is actually Jews who caused the Holocaust.

3:47

Almost 50% of the respondents were not able to name a single concentration camp

3:53

and almost

3:54

a quarter of them called the Holocaust a myth.

3:59

The Holocaust was the systematic bureaucratic state sponsored by the

4:04

persecution and murder

4:06

of 6 million Jews by the Nazi regime and its collaborators.

4:11

It was practiced by locking Jews in gaztos, by executing them in 12th ground,

4:16

by forcing

4:17

them to work in labor camps and by mass killing them in death camps.

4:24

The last were the most secret yet the deadliest form of the Holocaust.

4:28

In contrast to the better known concentration camps, like for example Auschwitz

4:33

, dead camps

4:34

served only one purpose.

4:36

They were created to carry mass scale mergers.

4:40

People who arrived at them were sent direct people gaztrenders and the

4:44

existence of these

4:45

places was kept secret even within the Nazis.

4:49

Therefore, so Ivar is a place that many never heard about.

4:53

It was built in spring 1942 as one of the three camps operating as a part of a

5:00

secretive

5:01

opposition, Reinhardt, which was a plan to kill most Polish Jews in the general

5:06

government

5:07

district.

5:08

This single mass murder campaign took lives of almost 2 million people.

5:14

In Sobibor itself almost 200,000 people were killed.

5:19

A group of resistance organized a revolt which led around 300 prisoners to

5:25

freedom.

5:26

As an aftermath the camp was closed and completely demolished, leaving no trace

5:31

of its existence.

5:34

Only 58 of those who escaped survived the world.

5:39

Due to the fact that the Nazis destroyed any evidence of Sobibor's presence,

5:44

the only

5:45

sources that could testify about the place where the aforementioned survivors

5:50

and the

5:51

guards who walked at the camp.

5:53

However, both sources were unfortunately not fully reliable.

5:57

The prisoners who escaped from Sobibor never saw with their eyes the deadliest

6:02

part of

6:02

it, so-called "laga trite" or in English "camp 3".

6:07

Namely the place where the gas chambers were located and where the bodies were

6:12

burnt and

6:12

burnt.

6:13

During the revolt, prisoners operating this part of the camp would not be

6:18

informed about

6:19

the upcoming events and therefore none of them survived the escape.

6:25

And as for the guards, the sincerity of their testimony could naturally be

6:28

caught in the

6:29

question.

6:31

This slide shows an aerial photo taken by Luftwaffe almost a year after the

6:37

camp was destroyed.

6:38

As you can see, except over a few buildings that could not be linked to any

6:43

criminal activities,

6:45

the place was torn to the ground.

6:47

Based on the available testimonies, the potential layout of the camp was

6:52

reconstructed.

6:53

However, new evidence still keeps coming to the light.

6:57

Just last year, some private photos from one of the Nazi guards working at the

7:03

camp were

7:03

found hidden at the attic.

7:05

The pictures were taken mostly during everyday activities and they show only

7:10

the Nazi residential

7:12

buildings, but even there, the historians were able to spot important evidence.

7:17

Like for example, here in the background we can see an excavator which was used

7:23

to dig

7:24

out the already buried decaying bodies in order to burn them as it was required

7:31

by their new

7:31

regulations.

7:33

Therefore, in order to learn more about the camp's layout, archaeological work

7:38

was carried

7:39

out at the place, starting in the year 2000.

7:42

It led to the discovery of mass graves with cork wax and human ashes and

7:48

archaeological

7:49

signs with the potential relics of the gas chambers.

7:53

The excavations continued as non-invasive workers and later in 2007, systematic

8:00

excavations

8:01

began which led to the discovery of the lagatite relics.

8:06

Due to a planned commemoration of the victims of the camp, further work at the

8:12

camp 3 was

8:12

carried out.

8:13

And this led to the discovery of foundations that undoubtedly belonged to the

8:19

former gas

8:19

chambers.

8:20

However, surprisingly, also intact human skeletons were revealed.

8:27

Here is a picture taken by the archaeologists working in the camp showing the

8:34

view on the

8:35

lagat and rice prepared for the commemoration of the victims.

8:42

And here are all the mass graves previously located at the camp 3.

8:48

However, unexpected skeletal burials were also found on the outskirts.

8:54

And expected as according to the available testimonies, all the Jewish victims

8:59

killed

8:59

at the camp were cremated.

9:03

The pictures showed two of the four skeletal graves recovered at the camp.

9:10

On the left, on the picture seen on the right is present at Vojik Manzurak, an

9:15

archaeologist

9:16

who led the excavations and with whom we collaborated during this project.

9:21

Later with his colleagues, when analyzing the discovery, they observed that the

9:26

layers

9:27

of the burials were crossing the camp ground players, which suggested that the

9:32

victims were

9:33

buried after the end of the criminal activity of the camp.

9:37

Additionally, the historians working on the case found a report on a criminal

9:43

activity

9:44

in the camp already after the war.

9:47

The remains of the villages in the area witnessed shops which were connected to

9:52

the crimes committed

9:53

by communists urging.

9:56

Therefore, it was assumed that the remains belonged to Polish partisans, killed

10:02

secretly

10:02

in the 1950s by communists.

10:05

The investigation was taken over by the Institute of National Remembrance and

10:10

based on their

10:11

decision, the remains recollected and sent to the Polish genetic database of

10:15

totalitarian

10:16

victims, a humanitarian project led by the stretching and forensic institute.

10:23

According to the procedures used in the project, first the remains were

10:27

submitted to anthropological

10:28

and medicalical assessments.

10:31

All 10 excavated skeletons were attributed to men who were estimated to be

10:37

between 20

10:38

and 60 years old.

10:40

Our scars showed traces of injuries caused by a firearm and once teleported on

10:46

the

10:46

plate signs of injuries caused by a gunshot to the chest.

10:50

All entry ones had a diameter of between 7 and 8 millimeter.

10:56

The localization of the entry ones pointed to a systematic way of execution of

11:01

the victims.

11:03

Shorts were aimed to the back of the head or near the neck and this pattern

11:08

indicated

11:09

that the bullets were fired from a shot weapon to a victim lying down or

11:15

kneeling with the

11:17

head strongly inclined downwards.

11:20

Altogether, more than 100 other parts were recovered during the field work in

11:27

Soviet

11:27

work.

11:28

Most of these included pieces of metal and fabric, which did not carry any

11:34

specific information

11:35

about the victims, however, some objects appeared to be some personal

11:40

belongings like, for

11:42

example, a pocket knife or a leather shoe.

11:46

And three of the artifacts found during the archaeological work were submitted

11:52

to ballistic

11:52

analysis.

11:53

Those were to grasp pistol shells and one iron made pistol projectile.

12:00

Two shells were assigned to a German company producing weapons during the

12:06

Second World

12:07

War and both shells were of caliber 7.65 millimeter and could be assigned the

12:15

brownie

12:16

system, which was difficult for self-loading pistols.

12:21

These hand guns were standard equipment of German guards in concentration and

12:26

death guns.

12:27

However, they were also used by the partisans after the war.

12:33

Bon samples were submitted to their lab workflow used in the project, which

12:38

after the bon preparation

12:40

focused on the golden standard of forensic DNA typing done with capillary elect

12:46

rophoresis.

12:46

This included also haploid marker analysis, NDB1 and NDB2 of the micro-hondrel

12:52

DNA were

12:53

sequenced with sandal and because of a high degradation of the samples, hapl

12:57

otypes were

12:58

obtained for half of the individuals.

13:00

But rather these results look interesting as three of the haplotypes were

13:06

assigned by

13:07

m-pop to the K haploid group, which is not typically expected in Poland,

13:12

especially not

13:13

the particular K1A damage.

13:15

This obviously literature search, which pointed at the K1A limit, is one of the

13:22

four most common

13:23

micro-hondrel haploid groups found among modern Ashken algorithms.

13:30

Based on those findings, the white chromosome on the data was also analyzed

13:34

from phylogenetic

13:35

perspective.

13:37

We started with only heads, namely, YSCR profiles, and we used them to perform

13:42

haploid

13:42

group estimation using a Neptune predictor.

13:46

Again, for eight out of 10 individuals, the predicted haploid groups are not

13:52

typically

13:53

found in Poland.

13:54

In fact, they are rather infrequent among all Europeans and found frequent big

14:00

among the

14:00

Ashkenazi, especially the J1-h

14:10

among Ashkenazi.

14:13

And looking at the haplotypes predicted as J1A, we observed that they are

14:18

almost identical

14:19

for all four individuals.

14:21

An observation concerning this particular similarity was done by other

14:25

scientists already in the

14:27

late '90s, when they studied genetics of Jewish priests.

14:34

The group of interests were Jewish Kohanim, who are believed to be of direct

14:35

patriena

14:41

are destined from the De Bicah Aron.

14:44

Study of their white chromosome revealed that the most frequent haploid group

14:49

observed among

14:50

the group is J3-58, and that almost all of them show even the same haplotype

14:57

called later

14:57

their co-hen model haploid types.

15:01

Using all of these preliminary observations left to our decision, that our

15:05

analysis should

15:06

be expanded.

15:07

For this matter, we established a collaboration with a group from INSCROC led

15:12

by Professor

15:13

Viter Tarzinn.

15:14

We plan to extend the mitochondrial analysis with performing the whole

15:19

mitochondrial genome

15:20

sequencing using the next generation sequencing.

15:24

And for the white chromosome, we decided to design primers for the white SNPs

15:28

suggested

15:29

by Nefter and to analyze them with Sanga sequences.

15:35

And the whole mitochondrial genome sequencing was done using their precision ID

15:39

mitochondrial

15:40

DNA whole genome panel and the ion-s system.

15:44

We used multiple extracts for each individual, as they were all highly bed-

15:49

related, with

15:50

some showing less than 100 micro copies per microliter.

15:55

Nevertheless, we were successful and we obtained whole microgenomes for all 10

16:01

individuals,

16:02

which were assigned to AMPOP to eight different haplogroups.

16:07

Since the largest published, Ashkenazi mitochondrial DNA datasets are based on

16:12

controlled region

16:13

sequences, we used those to estimate relative frequencies of their haplogroups

16:19

observed

16:20

among Ashkenazi groups of different European origin and we contrasted them to

16:26

the distribution

16:27

of the same haplogroups among corresponding European populations.

16:32

The data showed that all of haplogroups established for the study individuals

16:38

and corresponded

16:39

to a lineage found in Ashkenazi while being rare or missing in other European

16:45

populations.

16:46

To do a more specific search, we queried the obtained control region haplotypes

16:52

against

16:53

AMPOP database, which we called almost 40,000 worldwide control region

16:59

sequences.

17:00

And the comparative search was also performed for the microgenome data querying

17:05

a dataset

17:06

of almost 30,000 worldwide microgenomes, downloaded from gene bank.

17:11

All individuals got matches, which consisted of or at least included samples of

17:18

non-Ashkenazi

17:19

origins.

17:20

Since the original haploteases assume that the remains belong to Polish

17:25

partisans, further

17:27

phylogenographic analysis were performed on a selected set of Polish microtypes

17:32

and on

17:33

available Ashkenazi microgenomes.

17:36

For each haplogroup observed among the study individuals, we prepared a

17:41

separate dataset

17:43

which consisted of haplotypes and interests, their Polish and the Ashkenazi

17:49

microgenomes

17:50

and we performed multiple sequence alignments in the DIGS.

17:54

This analysis revealed that four of the established haplogroups clustered with

18:00

known Ashkenazi

18:01

images and the mitochondrial DNA results of finding the study had special

18:06

meaning since

18:07

in Judaism, materialina and descent is of a special importance.

18:13

As for the paternal lineage, while the Sunday is deciding on which SNP sequence

18:18

we looked

18:19

at both national predictions and at the markers mentioned by literature.

18:25

As the data is evolving and the phylogenetic number structure is changing.

18:30

The standard sequencing revealed and derived are used at all the analyzed signs

18:37

thereby

18:37

confirming that most of the lineages observed among the study individuals are

18:43

infrequent

18:44

among Europeans, especially the J358 lineage.

18:49

Two white chromosomes were assigned to haplogroup R, which is frequently

18:53

observed in both

18:55

Ashkenazi and other European populations.

18:57

However, studies show that a particular L1A lineage, namely M at 5H2 that we

19:05

observed

19:05

now study, is extremely rare in non-Jewish populations.

19:11

As it was confirmed that four individuals assigned to J

19:12

haplogroup belong to their J58 lineage, we continued our work concerning the C

19:22

auhan

19:22

model haplotype.

19:23

The first published haplotype was based on only six YSTR markers, presented

19:29

here on the

19:30

slide, and our STR data included five of these markers, and all the observed

19:37

values were fully

19:38

recorded with the Cauhan model haplotype.

19:41

A later study in the topic resulted in extending the markers set to 12 YSTRs,

19:49

and when comparing

19:50

our data, we observed that two of four individuals still show a match with the

19:57

haplotype and interest.

19:59

But in order to perform a full comparison, we were missing a Lele data for two

20:06

YSTRs,

20:07

which are not included in any modern commercial forensic technique.

20:12

And therefore, we sequenced the markers in interest with Sangha, and as a

20:19

result, we obtained

20:20

a full match with the extended Cauhan model haplotype for two of the study

20:25

individuals,

20:26

and the remaining ones were close neighbors.

20:30

Our genetic and phylogenographic analysis were submitted to the prosecutor's

20:35

office from

20:36

the Institute of National Remembrance, who led them in the investigation

20:41

concerning the

20:42

remains.

20:43

Based on that, the responsible authorities ordered every burial of the remains.

20:49

As seen on the photo, the ceremony was led by a rabbi, and it was held

20:55

following George

20:56

Wright.

20:57

The victims were buried in separate graves at the place of their discovery.

21:04

Earlier this year, the results of this collaborative interdisciplinary study,

21:09

including specialists

21:10

from history, archaeology, anthropology, statistics, and parenthesis, were

21:15

published in genome

21:16

biology.

21:17

Since then, we continuously endeavored to promote our findings, as we agreed

21:23

with one of the

21:24

sub-uberance survivors' sets, education is the key to a better world.

21:30

I had an honor to present the results of the work of many great specialists

21:34

from different

21:35

fields, and I would like to thank all of them for collaborating on this project

21:42

Thank you for your attention.

21:50

I am happy to answer any questions.

21:53

Hello.

21:54

My name is Jesse Conchsen.

21:55

I am the director of the Human DNA Identification Laboratory at the University

21:58

of Nebraska Medical

21:59

Center in Omaha, Nebraska.

22:01

Today I'm going to discuss our recent development and work towards using next-

22:05

generation sequencing

22:06

technology to analyze a two-person mixture of ancestry and phenotype polymorph

22:11

isms, and

22:11

their decombulation to provide investigative leads.

22:16

I have no competing financial disclosures to report.

22:22

In this talk, I will discuss the reasoning for addressing this problem, the

22:26

theory and

22:26

logic behind our approach and the development of our corresponding tool, the

22:30

process we

22:31

used in validation, and lastly, how we can use this tool in order to assign

22:35

ancestry and

22:36

phenotypes to two individuals that have contributed to a single specimen.

22:43

The central question our laboratory has had was whether we could use ancestry

22:47

and phenotype

22:47

testing to help investigators bolster or refute potential investigative leads.

22:53

Using this information, could we provide our partners with potential physical

22:56

characteristics

22:57

that a person of interest may have?

23:03

Welcome to this question, and underpinning biochemistry and molecular biology

23:07

is the principle that

23:07

DNA is transcribed into RNA that is then translated into proteins.

23:12

Further, this information encoded in the standard nucleotides is both

23:16

simultaneously unique to

23:18

an individual but is passed from generation to generation.

23:24

As we are all aware, this genetic material is progressively packaged within

23:28

each nucleated

23:29

cell and inherited from mother and father.

23:34

Speaking toward the single nucleotide polymorphism and other genetic

23:37

alterations that are inherited

23:39

and at the same time make us unique, certain SNPs have been identified in which

23:43

particular

23:44

alleles are more commonly found in individuals of certain ethnicities.

23:48

In the pictured reference SNP example, the reference allele C is seen

23:53

approximately 99%

23:56

of persons of Asian descent, whereas nucleotide C is seen only about 4% of the

24:02

time in persons

24:04

of European descent.

24:05

Thus, if a C is seen at this particular SNP, the individual will be much more

24:10

likely to

24:11

be of Asian descent as compared to European descent.

24:18

With this principle in mind, assessment of SNPs across the genome with varying

24:22

frequency

24:23

and peoples of differing geographic origins may be used to imply where a

24:27

particular person

24:28

may have descended from.

24:30

In fact, this type of approach is how many of the ancestry services go about

24:35

providing

24:36

detailed ancestry data to their patrons.

24:43

To bring this type of testing in-house, we assessed at the time the various NGS

24:47

platforms

24:47

available to determine ancestry.

24:50

We identified the ion torrent S5 platform as being suitable for our purposes,

24:55

running

24:55

the Precision ID ancestry panel.

24:58

Our human DNA identification laboratory works alongside our molecular diagn

25:02

ostics laboratory

25:03

at UNMC, which was using ion torrent technology as well, giving us some

25:09

familiarity with the

25:10

strengths and weaknesses of the platform.

25:13

Further, the ion chef, Lipid Handeler, enabled West Hands-On Time for our techn

25:18

ologists so

25:18

we could better receive our workflows.

25:21

The panel we chose to assess was the Precision ID ancestry panel based off

25:25

ancestry-focused

25:27

SNPs identified and characterized by doctors' kid and selden.

25:34

From this panel, the broad regions identified and ancestry testing included

25:39

those in the

25:40

figure on screen, including America, Europe, North Africa, Middle East, Sub-

25:46

Saharan Africa,

25:47

South Asia, East Asia, as well as Oceana.

25:52

These data would then allow us to intentionally interpret the hands-sustery of

25:56

an individual

25:57

from the geographically distinct areas.

26:02

As discussed earlier, genes encoded in genomic DNA is transcribed into RNA and

26:07

subsequently

26:07

translated into protein, to building blocks that give rise to our physiological

26:12

and physical

26:12

characteristics.

26:14

To respect the question, could certain SNPs be identified which could predict

26:18

certain

26:18

traits, such as hair color and eye color?

26:23

And the obvious answer to the prior question is an unequivocal yes, and the

26:26

tools are freely

26:27

available online to perform such predictions.

26:31

Screen captures from one such tool, the Kaiser 6 blocks are shown on the screen

26:35

now.

26:36

Using six different SNPs, eye color, blue versus brown versus a mixture of the

26:40

two, can

26:41

be predicted dependent on the genotype input into the tool.

26:48

Broadening this notion out, as applicable to the platform we chose for our NGS

26:52

testing,

26:52

we identified and validated the ion, amplicity, DNA phenotyping panel to

26:56

provide phenotype

26:58

information in addition to our ancestry panel.

27:01

Based upon the work out of the Kaiser and Walsh Laboratories, the set of 17 SNP

27:06

targets

27:07

can be used to predict eye color, hair color and hair tone.

27:14

To validate NGS for use in our laboratory, 83 samples were tested over the

27:17

course of approximately

27:19

2,000 runs.

27:20

As part of our validation, we included members of our institution with varying

27:24

ethnicities,

27:25

reference materials, as well as various specimen sources.

27:30

These data were then analyzed to assess for precision and accuracy, reproduc

27:34

ibility, as

27:35

well as sensitivity.

27:37

We were also very interested in assessing whether we could deconvolute mixtures

27:41

as we

27:41

oftentimes encounter samples with more than one contributor.

27:46

To specifically assess precision and accuracy, five individuals were run

27:52

multiple times.

27:54

Resulting genotypes were at worst 97% identical between the individual runs.

28:00

Phenotypes of the ancestry is found to be 96.8% concordant with self-reported

28:05

ancestry,

28:06

with the most pronounced deviation occurring in those of Hispanic descent.

28:11

Phenotype was concordant in 93% of runs.

28:17

We next assessed sensitivity of the assay through the use of serial dilution of

28:22

input

28:22

material.

28:23

We were able to determine the optimal DNA input range to be in the 0.2-2 nan

28:28

ogram range.

28:29

Further, we assessed whether no template controls could be run to both examine

28:33

for contamination

28:34

as part of our standard workflows, as well as to help optimize our overall

28:39

workflow in

28:40

the laboratory.

28:42

Because the sequencing used, because the chips which are used for sequencing

28:46

are designed

28:47

to accommodate multiple individual samples, a balance between batching samples,

28:54

as well

28:55

as to accommodate reasonable turnaround times was an important consideration

29:00

for us.

29:00

We were pleased to determine that no template controls or blanks could be run

29:04

without affecting

29:05

overall results on a partially filled chip, which can therefore bolster both

29:10

our workflows

29:11

as well as our turnaround time.

29:15

We also wanted to confirm that a variety of specimen sources would be suitable

29:19

for this

29:19

type of testing.

29:21

Because of this, we assessed standard samples such as buckle swabs, blood, as

29:25

well as hair

29:26

roots.

29:27

We were also able to confirm that other sources would work as well, including

29:31

saliva, semen,

29:32

degraded tissues, as well as touch DNA.

29:37

These data gave us confidence that this platform would work for sources

29:40

commonly encountered

29:41

in the molecular forensics laboratory.

29:46

The last major area we were very interested in assessing and validating

29:49

concerned the

29:50

assessment of ancestry from mixtures of two individuals.

29:54

At the very least, we were interested in identifying ancestry of a major

29:58

contributor

29:58

with identification of the minor contributor being a bonus.

30:03

During our validation, we prepared mixtures of individuals at different ratios

30:08

between

30:08

one to one down to ten to one.

30:11

Using the assay supplied HID SNP genotype route, we could generally assign

30:16

ancestry to a major

30:17

contributor, especially at ratios between three to one and ten to one.

30:22

Importantly, however, the minor contributor was never able to be determined.

30:28

To illustrate why assignment of a minor contributor could not occur, we can

30:32

list how a reference

30:33

in alternate fields should be observed given various scenarios concerning the

30:37

homozygous

30:38

heterozygous status, both a major and minor contributor.

30:42

As you can see in the table to the right, we can, depending on the scenario,

30:48

determine

30:49

whether or not an individual will have either 50% between reference and

30:54

alternate if their

30:55

heterozygous or 100% for reference or 100% for alternate depending on the

31:02

status.

31:03

Now if we expand this table, and we take this scenario list and expand it, let

31:08

's suppose

31:08

in ratio of a major and minor, in this case, say an 80 to 20 ratio, we can

31:14

develop predicted

31:15

varying allele frequencies for a reference and an alternate allele to occur

31:20

given a particular

31:21

scenario.

31:22

So for example, in this 80 to 20 mixture of major and minor, where the minor

31:26

contributor

31:27

is heterozygous and the minor contributor is homozygous for the reference, we

31:32

would predict

31:33

that the varying allele frequency of the reference allele should be

31:37

approximately 60%.

31:38

We play a similar game, and we assume that the major is homozygous for the

31:42

alternate

31:43

allele, whereas the minor is homozygous for the reference allele, we would

31:49

expect the

31:50

reference allele to be present at 20% at the time.

31:54

Taking a step in the other direction, we can use the observed variant of the

31:57

allele frequency

31:58

of a reference allele determined by the equations in the rightmost column where

32:02

R equals percentage

32:03

of major to predict which scenario was in place with regards to the genotype of

32:08

the major

32:09

and minor contributor.

32:13

Using these predictions, we developed a tool that given a percentage of major

32:17

to minor

32:18

can ingest coverage data produced on our S5 and output corresponding genotypes

32:23

for a

32:23

major contributor and the minor contributor.

32:26

In the screen captures presented, a 5 to 1 ratio is analyzed, and we were able

32:30

to determine

32:31

a number of genotypes which differ between the major and the minor.

32:37

To verify our genotype deconvolution tool could reasonably perform and compare

32:43

deconvolution

32:44

results of major and minor to genotype results, and the contributors were run

32:48

as our gold

32:49

standard of a single source specimen.

32:52

We looked at the accuracy of ratios ranging from 2 to 1 out to 10 to 1.

32:57

Results are graphically represented on the left with the top line representing

33:01

the major

33:02

and the lower representing minor contributors.

33:05

The table on the right shows the number of correct alleles with the maximum

33:09

correct being

33:10

100 and maybe 7.

33:12

Also presented as the average number of alleles which were collected and I

33:15

correctly call

33:16

them along with standard deviation.

33:19

Lastly, the percentages reflect the number of alleles with a denominator of 107

33:26

Overall the decongolition tool performs best at ratios between 4 to 1 and 9 to

33:31

1 with up

33:32

to 92.5% of the minor genotype being correctly assigned.

33:37

With these data, we next want to determine whether we could ascertain useful

33:41

phenotype

33:41

and ancestry interpretations from the decongulated data.

33:46

To do so, we developed tables in order to ingest our genotype data to assign an

33:55

ancestry.

33:55

Our first goal was to recreate the Bioinformatics tools provided in the HID SNP

34:00

genotype route.

34:01

At the same time, we sought to further delineate some of the larger geographic

34:05

regions such

34:06

as the America regions in order to give information which was a little more

34:12

granular.

34:13

To do this, we collected SNP frequency data for different geographic regions

34:16

from the

34:17

alpha database, the HGDP database, the page database and from Alfred.

34:23

The frequency tables from these databases were then used in conjunction with

34:27

the Hardy-Wine

34:27

group formulas to assess likelihood of a given SNP occurring in a person

34:32

descended from a

34:33

particular region.

34:35

Snips were then combined using the product rule and the geographic regions were

34:39

ranked

34:39

as displayed on the screen now.

34:44

Overall, the results of these analyses are then presented in the following form

34:48

with

34:49

the results using each database presented individually.

34:52

Dependent on the results, some databases are excluded.

34:56

For instance, assignment of a person under the European umbrella will lead to

35:03

the exclusion

35:04

of results from the page database as the page database does not have to be

35:09

represented.

35:10

If these data are presented most of the time, people of European descent will

35:14

be presented

35:15

as being of human descent.

35:18

Databases are then combined into an overall summary panel which includes 11

35:21

geographic

35:22

regions again to increase the granularity over their original genotype results.

35:27

In particular, the original America group has been expanded into Latin America

35:31

East which

35:31

includes places like the Caribbean, Latin America West which includes places

35:36

like Mexico, North

35:37

America as well as South America.

35:40

Finally, results are presented in a radar plot to better help the reader

35:45

visualize the output

35:47

for analysis.

35:48

In the example presented on screen, the major contributor is a Latin America

35:55

West descent

35:56

whereas the minor contributor is a European descent.

36:02

To assess how the ancestry calling performed from our tool, we compared the

36:06

results using

36:08

the former HID SNP genotype tool to our developed tool and found no significant

36:14

differences

36:15

when looking at single source specimens and those data are not shown.

36:19

We then assessed how to mix your deconvolution compared with assignment using

36:23

single source

36:24

specimen and self-reporting and found that the major was conchordant

36:28

approximately 91%

36:29

of the time with the minor contributor being conchordant approximately 50% of

36:37

the time.

36:38

Because we plan to use this tool to help with investigative leads, we looked at

36:42

whether

36:42

our misses were close geographically as people's of neighboring geographic

36:47

regions may share

36:48

physical characteristics.

36:51

So for example, if we incorrectly called somebody as being from Latin America

36:56

West when they

36:57

should have in fact been North American or vice versa.

37:01

Because we planned when loosening our astringency, accuracy increased 96% per

37:07

major and 82% for

37:09

the minor and also keep in mind this includes all data examined at ratios from

37:14

3.1 out to

37:15

10.

37:16

And especially at that 10.1 ratio, our data did not perform.

37:19

Our deconvolution tool did not perform very well.

37:24

We next wanted to look more specifically at ratios where the genotype deconv

37:27

olution tool

37:28

performed best.

37:29

Our accuracy rose substantially at the 6.1 level where we saw 100% concordance

37:34

in the major

37:35

ancestry and 83% in the minor.

37:38

Allowing again for our less stringent interpretation, accuracy was up to 100%

37:43

at the 91 and 61 levels

37:45

for both major and minor contributors.

37:52

In keeping with assessing for phenotype in addition to ancestry, we also

37:57

adopted informatics

37:59

tools to assign eye and hair color for both our major and minor contributor and

38:03

the results

38:03

into output performed similarly as our ancestry tool.

38:10

So overall, the tool we have put together to deconvolute mixtures of two

38:13

individuals

38:13

to provide ancestry and phenotype data for both a major and minor looks like

38:17

the image

38:18

on the screen.

38:19

As we've worked with this tool during its development, we have identified a

38:22

number of

38:23

rules and caveats for interpretation, but overall, we think that this will be

38:27

useful

38:27

for our investigative partners.

38:30

Specific rules, for example, include identifying a major contributor of African

38:34

descent, which

38:35

may skew results for a minor contributor.

38:41

In summary, next generation sequencing allows for the assessment of many

38:45

individual genomic

38:45

locations and assessment of SNP status of these locations and can be used to

38:51

assign ancestry

38:52

and phenotype information.

38:55

Using variant allele frequency of a reference allele allows us to predict which

38:59

genotype

38:59

scenario is occurring between a major and minor contributor and applying this

39:03

over many

39:04

SNPs can help us to assign corresponding interpretive data for both ancestry

39:09

and phenotype.

39:10

Lastly, we believe that this will be a valuable tool for our investigative

39:16

partners.

39:17

And with that, I'd like to thank you for your time and attention.

39:21

I would be happy to answer any questions you may have.

39:24

A big thank you to Martha and Jessie for those incredibly interesting

39:29

presentations.

39:30

If you have a question for either of them or even both of them, now would be

39:34

the time

39:34

to submit it through the Q&A panel that you see on your screen.

39:39

And while we wait for everyone to submit their questions, why don't we do a few

39:43

polling questions

39:44

really quickly.

39:46

Our first polling question is, does your agency utilize Y chromosome analysis?

39:51

Yes, YSTRs for sexual assault, YSTRs for homicide cases, YSTRs for mixture

39:59

analysis,

40:00

YSTRs for kinship familial analysis, Y screen for sample screening, or no?

40:07

Go ahead and get your answer in there.

40:11

The largest percentage is no, but then followed by yes, YSTRs for sexual

40:17

assault cases.

40:18

So that's really interesting.

40:20

All right, let's go to our next question.

40:24

Does your agency utilize NGS next-generation sequencing analysis?

40:29

Yes, for MTDNA analysis.

40:32

Yes, for SMP analysis.

40:34

Yes, for STR analysis.

40:37

No, but we do plan to do so in the future or just no.

40:41

Let's see what everyone said.

40:45

That's awesome.

40:46

It's no, but you do plan to do so in the future, which is really interesting

40:49

because

40:50

as we know, NGS is really taking off.

40:52

So that's interesting.

40:54

All right, our last one before we get into Q&A, would you like to receive more

41:01

information

41:02

on today's webinar topics?

41:04

Whether that be about YSTRs analysis, MTDNA analysis, using NGS or SMP analysis

41:10

using

41:11

NGS.

41:12

All right, thank you everybody so much for answering those questions.

41:18

Now we will get to the fun part, the attendee questions.

41:23

So Jesse, Martha, you guys still with us?

41:26

I think so.

41:28

Awesome.

41:29

All right, Jesse then, we will give you your first question.

41:34

Jesse, can you elaborate how you subdivided the Americas and how that made for

41:39

better

41:40

predictions in your research?

41:41

Yes, so when we were going through initially kind of setting up our frequency

41:46

tables and

41:47

whatnot, the first thing that we were trying to do was more or less recreate

41:51

what the

41:52

Bioinformatics platform we had already been using for the process of going

41:56

through that

41:57

whole de-contribution process.

41:59

Since we had the opportunity to kind of expand and look at some of these other

42:04

databases,

42:05

many databases especially like Alfred, for example, have a lot of granularity

42:09

when it

42:10

comes to the different ethnicities and peoples that they've done sequencing for

42:16

And so to kind of address some of the shortcomings, especially those folks

42:20

which were of Latin

42:21

American descent, we kind of took advantage of some of that granularity to help

42:26

kind of

42:27

generate and build out some of these more subdivided groups especially over in

42:31

the Americas.

42:32

So hopefully that's helpful.

42:34

Absolutely.

42:35

Now, how would you adapt this to other panels?

42:40

And so for other panels, a lot of it is just determining which SNPs are

42:44

interrogated and

42:45

looked at there.

42:46

Dependent upon which SNPs are there, what you'd have to go to is identify the

42:50

frequency

42:51

tables for those particular SNPs.

42:53

So in order to adapt the current tool that we have right now to say a new up-to

42:58

-date

42:58

or a different platform or something, there would be a lot of kind of heavy

43:02

lifting towards

43:04

addressing that and kind of getting to that.

43:06

But it's definitely doable, it would just take some time.

43:09

Okay.

43:10

Now another one if you, Jesse, Marta, before we move on to a few questions for

43:14

you.

43:14

So Jesse, how do you focus on the Y SNPs for a certain contributor, whether

43:19

that's major

43:20

or minor?

43:21

Do you have to perform STR analysis first and then focus on the certain alleles

43:26

from that

43:26

contributor?

43:28

So the way that we're envisioning that this workflow will work is we're going

43:32

to do traditional

43:33

STR analysis to establish a proportion of major to minor.

43:40

We're not currently using YSTRs or Y SNPs to kind of delineate between major

43:44

versus minor.

43:45

So it would just be more looking at those autosomal STR contributions and what

43:51

not to

43:51

establish that we have a two-person mixture and then from there what is the

43:54

ratio between

43:55

the major and the minor.

43:56

So hopefully that's helpful.

43:58

Absolutely.

43:59

Thank you.

44:00

I'm sure we'll come back to you.

44:02

But Marta, we want to ask you a few questions that we have from the audience.

44:06

The first one is, can you tell us in your study what were the biggest technical

44:12

issues

44:12

that you faced?

44:14

The biggest technical issue was actually with the bone material itself.

44:19

In a project we collected almost 800 bone samples and the ones collected in

44:23

probably

44:24

Boudoir.

44:25

One of the most liquid databases we had to work with, we implemented many

44:30

different DNA

44:31

extractions methods.

44:33

So we started with PrEPFiler BTA, then we continued with the organic funnel

44:38

chloroform

44:39

and still we were not happy with the results as you saw, the first results from

44:40

Sanga

44:46

were definitely incomplete.

44:48

So the next step was introducing the in-house methods from in-group for the DNA

44:54

extraction

44:55

that was lately published and luckily we had enough extract for each individual

45:02

to continue

45:03

with their next generation sequencing.

45:06

And we got home microgenomes and the Y chromosome data, but the SDR, the Y SDR

45:14

sequencing had

45:15

to be repeated many times and the mitochondrial and DNA was also extremely

45:20

degradation.

45:21

So it was a big technical challenge to even get the satisfying results.

45:26

Okay.

45:27

Can you comment and talk to us about how adding whole genome MIDO analysis by N

45:33

GS actually

45:34

aided in the identification that you were able to make?

45:38

Yes.

45:39

So without my Tohonjua, the homo-tohonjua genome, we would not get phylogenetic

45:44

resolution

45:45

that we wanted.

45:47

So the slides of Sanga showed that for three individuals, we got the prediction

45:51

of K-HAPLO

45:53

group, even though for two of them, the control regions of HV1 and HV2 allowed

45:59

only to predict

46:00

the HAPLO group SK, which was definitely not informative enough.

46:04

So thanks to the whole genome sequencing, we got a higher phylogenetic

46:08

resolution, first

46:09

of all, and second of all, when we were comparing the homo-tohonjua genome

46:15

sequences with the

46:17

reference data from Polish and Ashkenazi, Michael Genome, we definitely got

46:25

also a better

46:26

understanding of where the results fall in phylogenetically.

46:30

So we were able to cluster the micro-hondyl genome, which was not able only

46:35

based on the

46:36

control-avision sequence.

46:38

Okay, gotcha.

46:39

Now, what was the biggest challenge of publishing your results?

46:45

I think that the biggest challenge was how complex our study is, and even

46:52

though we, like

46:53

the major focus was on DNA, obviously, and I'm a forensic scientist, and all of

46:59

the results

47:01

come from a forensic scientist, we could not ignore how complex the history

47:07

behind our

47:10

findings.

47:11

So when writing the paper, I had to go very deep into the history of everything

47:17

, and we

47:17

had to work with archeologists and anthropologists, because we could not

47:22

present the results without

47:24

exploring the history of the findings.

47:28

We also wanted people to understand how important the results are and how big

47:34

the discovery

47:35

is, and this would not be possible without putting all of the historical

47:39

background to

47:40

it.

47:41

And this is something we just have a very typical forensics.

47:44

So introducing so much of history and archeology into our study.

47:51

Gotcha.

47:52

Okay, and that sounds like a fascinating article.

47:55

Can you remind the audience again what's the name of it is, and it's published

47:58

in genome

47:59

biology, is that correct?

48:01

Yes, it's published in genome biology in August this year.

48:04

It's genetic and phylogenetic evidence for Jewish Holocaust victims, a found in

48:09

Saudi

48:09

border, that can, so I hope that the paper includes everything I said and

48:16

beyond.

48:17

So I think it's great if people can really learn about the story, because I

48:25

think that

48:26

this is a part of history that many really don't know much about, and I think

48:31

that our

48:32

paper just tries to shed some light on it.

48:36

Absolutely.

48:37

And I have one more question for you before we throw it back to Jessie to round

48:39

this up.

48:41

You know, what's next for you?

48:42

Are there current efforts to identify the remains?

48:45

Are you yourself planning any more humanitarian projects concerning the remains

48:49

of the Second

48:50

World War?

48:51

This would be of course an outstanding achievement to be able to identify

48:57

people.

48:58

And this is something that of course we hope for you.

49:01

We don't know yet exactly if this would be possible.

49:05

Right now when we have the phylogenetic data and we know that the people of

49:11

Jewish origin,

49:13

we have to consider who the victims could be.

49:16

And we already know that there were 200,000 people killed in Saudi border, but

49:23

prisoners

49:24

who actually destroyed the camp were the prisoners from the other camps, which

49:28

was Treblin, our

49:29

800,000 people were killed.

49:31

So we are looking potentially at 1 million victims.

49:36

So I know that there are amazing tools in the forensics right now, in the

49:41

foreign industry,

49:42

geology, we have all of the extended analysis.

49:44

So I hope that one day this study moves forward because giving names back to

49:48

those victims

49:49

would be just something absolutely great.

49:53

And since this project concerns my previous workplace and my previous team, I

50:00

right now

50:01

moves to Munich and I work at the Institute of the Kalmani-sen in Germany.

50:06

And even though I focused more on implementing NGS into routine investigation,

50:12

I never stopped

50:13

thinking and talking about bonds.

50:15

So everyone who knows me also hear about this from me almost daily because this

50:20

is just

50:20

my greatest session in forensics.

50:22

And for a while already I'm working on another humanitarian project that was

50:27

starting my

50:28

institute with other colleagues joining in that was introduced in the

50:34

scientific identification

50:35

of the foreign German soldiers from the second forward.

50:39

Oh wow, that's interesting.

50:41

We'll definitely have to look out for that.

50:43

Thanks, Marta.

50:44

Thanks.

50:45

Alright, Jesse, we have another one for you.

50:48

When do you anticipate that you can offer this to our investigative partners?

50:54

So more or less our validation is completed.

50:57

So we just have to write up a few more documents.

51:00

Obviously, so the kind of a null part of this is being able to take this NGS

51:05

data, parse

51:06

out a major contributor and a minor contributor and then obviously offer some

51:09

sort of, you

51:10

know, type and ancestry analysis on those.

51:12

So we'd like to have, you know, maybe a publication or something like that here

51:16

in the coming

51:17

months or whatnot.

51:18

But more or less, we're pretty close to being able to offer this to

51:22

investigative agencies.

51:23

Wow, that's fantastic.

51:25

Alright, and one more question for actually both of you before we call it a day

51:31

on this

51:31

webinar.

51:32

Jesse, we'll start with you.

51:34

Do you focus more on autosomal SMPs or YSMPs?

51:39

So for the system that we're using now, I mean, assay that we're using now, all

51:43

of

51:44

these ancestry SNPs and phenotypes SNPs are located on the autosomal

51:47

chromosomes.

51:48

You know, certainly if you're getting into more of the genealogy types of

51:53

studies, you're

51:54

going to broaden the number of SNPs that you're looking at.

51:57

And then when you're kind of getting into that, you can start looking at SNPs,

52:00

which

52:00

are located on X and Y respectively.

52:03

So at the current moment in time, everything that we're doing right now is more

52:07

focused

52:07

on the autosomal stuff, but certainly YSMPs are an area of interest.

52:11

It's to be kind of expand our testing more.

52:14

Marta, what about you?

52:16

Definitely on Y, which was always my biggest interest in the autosomal one.

52:23

And in everyday work, I also work with phenotypes, I think, as Jesse.

52:29

So Y chromosomes are of interest for me, also from the phylogenetic perspective

52:35

So definitely Y chromosomes.

52:37

Gotcha.

52:38

Okay.

52:39

Thank you guys.

52:40

All right.

52:41

Audience, that about wraps up all the time we have for today.

52:44

I'd like to thank our fantastic presenters, as well as thermofuscious

52:48

scientific, who's

52:49

a sponsor for not only today's webinar, but the entire 2021 Future Trends and

52:54

DNA Technology

52:55

series.

52:56

In 24 hours or less, this webinar will be available on demand if you'd like to

53:00

watch

53:00

it again or share it with friends and colleagues.

53:04

In fact, the entire 2021 series is available on demand on the forensic website,

53:09

www.ferensicmag.com.

53:12

Additionally, later today, you will receive an email with information on how to

53:16

obtain

53:17

CE credit documentation for your participation in this webinar.

53:21

And while this unfortunately concludes the 2021 webinar series, forensics and

53:25

thermofuscious

53:26

scientific are thrilled to continue this informative series next year.

53:29

So be on the lookout for our notice in early 2022.

53:32

Thank you so much, everyone, and have a great day.