The growing use of NGS in forensic genetic applications has resulted in the analysis of alternative markers that can complement CE-STR results for challenging samples- bones, mixtures, or degraded samples- to generate investigative leads. Marta Diepenbroek will discuss her collaborative work with colleagues from Poland and Austria, which concerns the first genetic and phylogeographic evidence for Jewish Holocaust victims found at the former Nazi death camp in Sobibór. Diepenbroek will discuss show how forensic analysis of haploid markers can shed light on historical events. Jesse Cox will describe the underpinnings of an approach to deconvolute mixtures of two individuals to provide ancestry and phenotypic information via NGS for both a major and minor contributor. In this webinar you’ll hear about: NGS analysis of the mitochondrial genome can be combined with CE-based Y STR markers to enable lineage determination of remains SNP analysis by NGS is used to deconvolute mixtures for investigative lead generation
0:00
Hello, and welcome to Breakthroughs and Forensic DNA Analysis using Next
0:04
Generation Sequencing,
0:06
brought to you by Forensic and sponsored by Thermo Fisher Scientific.
0:09
This is the fifth and final webinar in the 2021 Future Trends and Forensic DNA
0:14
Technology
0:15
Series.
0:16
My name is Michelle Taylor, Editor-in-Chief of Forensic and I will be your
0:18
moderator throughout.
0:20
For today's webinar, you can earn one hour of continuing education credit.
0:24
Following the conclusion of the webinar, you will receive an email with
0:27
information on
0:28
how to obtain CE Credit Documentation.
0:31
We have a great lineup scheduled to present to you today, but before we begin,
0:35
I'd like
0:35
to take just a moment to cover a few logistics.
0:38
At the end of the presentation, we will hold a question and answer session.
0:42
To ask a question, click on the "Ask a Question" tab in the upper right corner
0:46
of
0:46
your screen.
0:47
Please also take note that the right side of the screen features an overview of
0:50
today's
0:50
webinar as well as more information about our speakers.
0:54
If you have a technical question during today's event, click on the "Test your
0:57
Connection"
0:58
button on the bottom of your screen.
1:00
From there, you can access additional webinar support.
1:03
We also invite you to use a social media widget beneath the webinar to share
1:07
with your friends
1:07
and colleagues.
1:09
Today, you will hear from Marta Deepenbroke, a post-doctoral scholar at the
1:14
Ludwig Maximilian
1:15
University in Munich.
1:17
Marta will discuss her collaborative work with colleagues from Poland and
1:20
Austria concerning
1:21
the first genetic and phylogeographic evidence for Jewish Holocaust victims
1:26
found at the
1:26
former Nazi death camp in Sobobor.
1:30
For the past six years, Marta was a team member of the Polish genetic database
1:34
of victims
1:35
of Tolitarianisms, a DVI project with the purpose of employing forensic methods
1:40
to identify
1:41
victims of communist and Nazi regimes in Poland.
1:44
Currently, her target is the practical use of state-of-the-art forensic methods
1:49
in routine
1:50
investigations and presenting them to law enforcement.
1:54
Her work focuses on the newest MPS-based solutions.
1:57
We'll also hear from Jesse Cox, assistant professor at the University of
2:02
Nebraska Medical
2:03
Center.
2:04
Jesse will describe the underpinnings of an approach to de-combolute mixtures
2:07
of two individuals
2:09
to provide ancestry and phenotypic information via NGS for both a major and
2:14
minor contributor.
2:16
Jesse completed his AP/CP Pathology Residency and Molecular Genetic Pathology
2:21
Fellowship
2:22
training at the University of Nebraska Medical Center in Omaha.
2:26
He currently serves as director of the Human DNA Identification Laboratory and
2:30
HLA Tissue
2:31
Typing Laboratories at UNMC.
2:34
Thank you for joining us for the final webinar in the 2021 Future Trends in
2:38
Forensic DNA
2:39
Technology Series, and we hope you have enjoyed this series as much as we have.
2:44
Be sure to check your email earlier in the new year, because we will be
2:47
continuing this
2:48
series.
2:49
Now, without further ado, let's get underway.
2:52
Welcome, everyone.
2:53
My name is Matha Deepindrok.
2:55
I currently work as a product researcher in forensic genomics and the Institute
3:00
of Human
3:01
Disease at LMEU.
3:02
Today, it is my pleasure to present the results of a collaborative work I would
3:09
call it from
3:10
Poland and Austria.
3:11
And during my presentation, I will discuss how our genetic and phylogenographic
3:17
analysis
3:18
shed light on the historical events and the solvable death camp, because I
3:23
believe that
3:24
as forensic scientists for the death and the living, we must bear witness.
3:30
According to the last year's survey, conducted among young Americans, almost 70
3:36
% of them
3:36
are not aware how many Jews were killed during the Holocaust and what is more
3:42
some stated
3:43
that it is actually Jews who caused the Holocaust.
3:47
Almost 50% of the respondents were not able to name a single concentration camp
3:53
and almost
3:54
a quarter of them called the Holocaust a myth.
3:59
The Holocaust was the systematic bureaucratic state sponsored by the
4:04
persecution and murder
4:06
of 6 million Jews by the Nazi regime and its collaborators.
4:11
It was practiced by locking Jews in gaztos, by executing them in 12th ground,
4:16
by forcing
4:17
them to work in labor camps and by mass killing them in death camps.
4:24
The last were the most secret yet the deadliest form of the Holocaust.
4:28
In contrast to the better known concentration camps, like for example Auschwitz
4:33
, dead camps
4:34
served only one purpose.
4:36
They were created to carry mass scale mergers.
4:40
People who arrived at them were sent direct people gaztrenders and the
4:44
existence of these
4:45
places was kept secret even within the Nazis.
4:49
Therefore, so Ivar is a place that many never heard about.
4:53
It was built in spring 1942 as one of the three camps operating as a part of a
5:00
secretive
5:01
opposition, Reinhardt, which was a plan to kill most Polish Jews in the general
5:06
government
5:07
district.
5:08
This single mass murder campaign took lives of almost 2 million people.
5:14
In Sobibor itself almost 200,000 people were killed.
5:19
A group of resistance organized a revolt which led around 300 prisoners to
5:25
freedom.
5:26
As an aftermath the camp was closed and completely demolished, leaving no trace
5:31
of its existence.
5:34
Only 58 of those who escaped survived the world.
5:39
Due to the fact that the Nazis destroyed any evidence of Sobibor's presence,
5:44
the only
5:45
sources that could testify about the place where the aforementioned survivors
5:50
and the
5:51
guards who walked at the camp.
5:53
However, both sources were unfortunately not fully reliable.
5:57
The prisoners who escaped from Sobibor never saw with their eyes the deadliest
6:02
part of
6:02
it, so-called "laga trite" or in English "camp 3".
6:07
Namely the place where the gas chambers were located and where the bodies were
6:12
burnt and
6:12
burnt.
6:13
During the revolt, prisoners operating this part of the camp would not be
6:18
informed about
6:19
the upcoming events and therefore none of them survived the escape.
6:25
And as for the guards, the sincerity of their testimony could naturally be
6:28
caught in the
6:29
question.
6:31
This slide shows an aerial photo taken by Luftwaffe almost a year after the
6:37
camp was destroyed.
6:38
As you can see, except over a few buildings that could not be linked to any
6:43
criminal activities,
6:45
the place was torn to the ground.
6:47
Based on the available testimonies, the potential layout of the camp was
6:52
reconstructed.
6:53
However, new evidence still keeps coming to the light.
6:57
Just last year, some private photos from one of the Nazi guards working at the
7:03
camp were
7:03
found hidden at the attic.
7:05
The pictures were taken mostly during everyday activities and they show only
7:10
the Nazi residential
7:12
buildings, but even there, the historians were able to spot important evidence.
7:17
Like for example, here in the background we can see an excavator which was used
7:23
to dig
7:24
out the already buried decaying bodies in order to burn them as it was required
7:31
by their new
7:31
regulations.
7:33
Therefore, in order to learn more about the camp's layout, archaeological work
7:38
was carried
7:39
out at the place, starting in the year 2000.
7:42
It led to the discovery of mass graves with cork wax and human ashes and
7:48
archaeological
7:49
signs with the potential relics of the gas chambers.
7:53
The excavations continued as non-invasive workers and later in 2007, systematic
8:00
excavations
8:01
began which led to the discovery of the lagatite relics.
8:06
Due to a planned commemoration of the victims of the camp, further work at the
8:12
camp 3 was
8:12
carried out.
8:13
And this led to the discovery of foundations that undoubtedly belonged to the
8:19
former gas
8:19
chambers.
8:20
However, surprisingly, also intact human skeletons were revealed.
8:27
Here is a picture taken by the archaeologists working in the camp showing the
8:34
view on the
8:35
lagat and rice prepared for the commemoration of the victims.
8:42
And here are all the mass graves previously located at the camp 3.
8:48
However, unexpected skeletal burials were also found on the outskirts.
8:54
And expected as according to the available testimonies, all the Jewish victims
8:59
killed
8:59
at the camp were cremated.
9:03
The pictures showed two of the four skeletal graves recovered at the camp.
9:10
On the left, on the picture seen on the right is present at Vojik Manzurak, an
9:15
archaeologist
9:16
who led the excavations and with whom we collaborated during this project.
9:21
Later with his colleagues, when analyzing the discovery, they observed that the
9:26
layers
9:27
of the burials were crossing the camp ground players, which suggested that the
9:32
victims were
9:33
buried after the end of the criminal activity of the camp.
9:37
Additionally, the historians working on the case found a report on a criminal
9:43
activity
9:44
in the camp already after the war.
9:47
The remains of the villages in the area witnessed shops which were connected to
9:52
the crimes committed
9:53
by communists urging.
9:56
Therefore, it was assumed that the remains belonged to Polish partisans, killed
10:02
secretly
10:02
in the 1950s by communists.
10:05
The investigation was taken over by the Institute of National Remembrance and
10:10
based on their
10:11
decision, the remains recollected and sent to the Polish genetic database of
10:15
totalitarian
10:16
victims, a humanitarian project led by the stretching and forensic institute.
10:23
According to the procedures used in the project, first the remains were
10:27
submitted to anthropological
10:28
and medicalical assessments.
10:31
All 10 excavated skeletons were attributed to men who were estimated to be
10:37
between 20
10:38
and 60 years old.
10:40
Our scars showed traces of injuries caused by a firearm and once teleported on
10:46
the
10:46
plate signs of injuries caused by a gunshot to the chest.
10:50
All entry ones had a diameter of between 7 and 8 millimeter.
10:56
The localization of the entry ones pointed to a systematic way of execution of
11:01
the victims.
11:03
Shorts were aimed to the back of the head or near the neck and this pattern
11:08
indicated
11:09
that the bullets were fired from a shot weapon to a victim lying down or
11:15
kneeling with the
11:17
head strongly inclined downwards.
11:20
Altogether, more than 100 other parts were recovered during the field work in
11:27
Soviet
11:27
work.
11:28
Most of these included pieces of metal and fabric, which did not carry any
11:34
specific information
11:35
about the victims, however, some objects appeared to be some personal
11:40
belongings like, for
11:42
example, a pocket knife or a leather shoe.
11:46
And three of the artifacts found during the archaeological work were submitted
11:52
to ballistic
11:52
analysis.
11:53
Those were to grasp pistol shells and one iron made pistol projectile.
12:00
Two shells were assigned to a German company producing weapons during the
12:06
Second World
12:07
War and both shells were of caliber 7.65 millimeter and could be assigned the
12:15
brownie
12:16
system, which was difficult for self-loading pistols.
12:21
These hand guns were standard equipment of German guards in concentration and
12:26
death guns.
12:27
However, they were also used by the partisans after the war.
12:33
Bon samples were submitted to their lab workflow used in the project, which
12:38
after the bon preparation
12:40
focused on the golden standard of forensic DNA typing done with capillary elect
12:46
rophoresis.
12:46
This included also haploid marker analysis, NDB1 and NDB2 of the micro-hondrel
12:52
DNA were
12:53
sequenced with sandal and because of a high degradation of the samples, hapl
12:57
otypes were
12:58
obtained for half of the individuals.
13:00
But rather these results look interesting as three of the haplotypes were
13:06
assigned by
13:07
m-pop to the K haploid group, which is not typically expected in Poland,
13:12
especially not
13:13
the particular K1A damage.
13:15
This obviously literature search, which pointed at the K1A limit, is one of the
13:22
four most common
13:23
micro-hondrel haploid groups found among modern Ashken algorithms.
13:30
Based on those findings, the white chromosome on the data was also analyzed
13:34
from phylogenetic
13:35
perspective.
13:37
We started with only heads, namely, YSCR profiles, and we used them to perform
13:42
haploid
13:42
group estimation using a Neptune predictor.
13:46
Again, for eight out of 10 individuals, the predicted haploid groups are not
13:52
typically
13:53
found in Poland.
13:54
In fact, they are rather infrequent among all Europeans and found frequent big
14:00
among the
14:00
Ashkenazi, especially the J1-h
14:10
among Ashkenazi.
14:13
And looking at the haplotypes predicted as J1A, we observed that they are
14:18
almost identical
14:19
for all four individuals.
14:21
An observation concerning this particular similarity was done by other
14:25
scientists already in the
14:27
late '90s, when they studied genetics of Jewish priests.
14:34
The group of interests were Jewish Kohanim, who are believed to be of direct
14:35
patriena
14:41
are destined from the De Bicah Aron.
14:44
Study of their white chromosome revealed that the most frequent haploid group
14:49
observed among
14:50
the group is J3-58, and that almost all of them show even the same haplotype
14:57
called later
14:57
their co-hen model haploid types.
15:01
Using all of these preliminary observations left to our decision, that our
15:05
analysis should
15:06
be expanded.
15:07
For this matter, we established a collaboration with a group from INSCROC led
15:12
by Professor
15:13
Viter Tarzinn.
15:14
We plan to extend the mitochondrial analysis with performing the whole
15:19
mitochondrial genome
15:20
sequencing using the next generation sequencing.
15:24
And for the white chromosome, we decided to design primers for the white SNPs
15:28
suggested
15:29
by Nefter and to analyze them with Sanga sequences.
15:35
And the whole mitochondrial genome sequencing was done using their precision ID
15:39
mitochondrial
15:40
DNA whole genome panel and the ion-s system.
15:44
We used multiple extracts for each individual, as they were all highly bed-
15:49
related, with
15:50
some showing less than 100 micro copies per microliter.
15:55
Nevertheless, we were successful and we obtained whole microgenomes for all 10
16:01
individuals,
16:02
which were assigned to AMPOP to eight different haplogroups.
16:07
Since the largest published, Ashkenazi mitochondrial DNA datasets are based on
16:12
controlled region
16:13
sequences, we used those to estimate relative frequencies of their haplogroups
16:19
observed
16:20
among Ashkenazi groups of different European origin and we contrasted them to
16:26
the distribution
16:27
of the same haplogroups among corresponding European populations.
16:32
The data showed that all of haplogroups established for the study individuals
16:38
and corresponded
16:39
to a lineage found in Ashkenazi while being rare or missing in other European
16:45
populations.
16:46
To do a more specific search, we queried the obtained control region haplotypes
16:52
against
16:53
AMPOP database, which we called almost 40,000 worldwide control region
16:59
sequences.
17:00
And the comparative search was also performed for the microgenome data querying
17:05
a dataset
17:06
of almost 30,000 worldwide microgenomes, downloaded from gene bank.
17:11
All individuals got matches, which consisted of or at least included samples of
17:18
non-Ashkenazi
17:19
origins.
17:20
Since the original haploteases assume that the remains belong to Polish
17:25
partisans, further
17:27
phylogenographic analysis were performed on a selected set of Polish microtypes
17:32
and on
17:33
available Ashkenazi microgenomes.
17:36
For each haplogroup observed among the study individuals, we prepared a
17:41
separate dataset
17:43
which consisted of haplotypes and interests, their Polish and the Ashkenazi
17:49
microgenomes
17:50
and we performed multiple sequence alignments in the DIGS.
17:54
This analysis revealed that four of the established haplogroups clustered with
18:00
known Ashkenazi
18:01
images and the mitochondrial DNA results of finding the study had special
18:06
meaning since
18:07
in Judaism, materialina and descent is of a special importance.
18:13
As for the paternal lineage, while the Sunday is deciding on which SNP sequence
18:18
we looked
18:19
at both national predictions and at the markers mentioned by literature.
18:25
As the data is evolving and the phylogenetic number structure is changing.
18:30
The standard sequencing revealed and derived are used at all the analyzed signs
18:37
thereby
18:37
confirming that most of the lineages observed among the study individuals are
18:43
infrequent
18:44
among Europeans, especially the J358 lineage.
18:49
Two white chromosomes were assigned to haplogroup R, which is frequently
18:53
observed in both
18:55
Ashkenazi and other European populations.
18:57
However, studies show that a particular L1A lineage, namely M at 5H2 that we
19:05
observed
19:05
now study, is extremely rare in non-Jewish populations.
19:11
As it was confirmed that four individuals assigned to J
19:12
haplogroup belong to their J58 lineage, we continued our work concerning the C
19:22
auhan
19:22
model haplotype.
19:23
The first published haplotype was based on only six YSTR markers, presented
19:29
here on the
19:30
slide, and our STR data included five of these markers, and all the observed
19:37
values were fully
19:38
recorded with the Cauhan model haplotype.
19:41
A later study in the topic resulted in extending the markers set to 12 YSTRs,
19:49
and when comparing
19:50
our data, we observed that two of four individuals still show a match with the
19:57
haplotype and interest.
19:59
But in order to perform a full comparison, we were missing a Lele data for two
20:06
YSTRs,
20:07
which are not included in any modern commercial forensic technique.
20:12
And therefore, we sequenced the markers in interest with Sangha, and as a
20:19
result, we obtained
20:20
a full match with the extended Cauhan model haplotype for two of the study
20:25
individuals,
20:26
and the remaining ones were close neighbors.
20:30
Our genetic and phylogenographic analysis were submitted to the prosecutor's
20:35
office from
20:36
the Institute of National Remembrance, who led them in the investigation
20:41
concerning the
20:42
remains.
20:43
Based on that, the responsible authorities ordered every burial of the remains.
20:49
As seen on the photo, the ceremony was led by a rabbi, and it was held
20:55
following George
20:56
Wright.
20:57
The victims were buried in separate graves at the place of their discovery.
21:04
Earlier this year, the results of this collaborative interdisciplinary study,
21:09
including specialists
21:10
from history, archaeology, anthropology, statistics, and parenthesis, were
21:15
published in genome
21:16
biology.
21:17
Since then, we continuously endeavored to promote our findings, as we agreed
21:23
with one of the
21:24
sub-uberance survivors' sets, education is the key to a better world.
21:30
I had an honor to present the results of the work of many great specialists
21:34
from different
21:35
fields, and I would like to thank all of them for collaborating on this project
21:42
Thank you for your attention.
21:50
I am happy to answer any questions.
21:53
Hello.
21:54
My name is Jesse Conchsen.
21:55
I am the director of the Human DNA Identification Laboratory at the University
21:58
of Nebraska Medical
21:59
Center in Omaha, Nebraska.
22:01
Today I'm going to discuss our recent development and work towards using next-
22:05
generation sequencing
22:06
technology to analyze a two-person mixture of ancestry and phenotype polymorph
22:11
isms, and
22:11
their decombulation to provide investigative leads.
22:16
I have no competing financial disclosures to report.
22:22
In this talk, I will discuss the reasoning for addressing this problem, the
22:26
theory and
22:26
logic behind our approach and the development of our corresponding tool, the
22:30
process we
22:31
used in validation, and lastly, how we can use this tool in order to assign
22:35
ancestry and
22:36
phenotypes to two individuals that have contributed to a single specimen.
22:43
The central question our laboratory has had was whether we could use ancestry
22:47
and phenotype
22:47
testing to help investigators bolster or refute potential investigative leads.
22:53
Using this information, could we provide our partners with potential physical
22:56
characteristics
22:57
that a person of interest may have?
23:03
Welcome to this question, and underpinning biochemistry and molecular biology
23:07
is the principle that
23:07
DNA is transcribed into RNA that is then translated into proteins.
23:12
Further, this information encoded in the standard nucleotides is both
23:16
simultaneously unique to
23:18
an individual but is passed from generation to generation.
23:24
As we are all aware, this genetic material is progressively packaged within
23:28
each nucleated
23:29
cell and inherited from mother and father.
23:34
Speaking toward the single nucleotide polymorphism and other genetic
23:37
alterations that are inherited
23:39
and at the same time make us unique, certain SNPs have been identified in which
23:43
particular
23:44
alleles are more commonly found in individuals of certain ethnicities.
23:48
In the pictured reference SNP example, the reference allele C is seen
23:53
approximately 99%
23:56
of persons of Asian descent, whereas nucleotide C is seen only about 4% of the
24:02
time in persons
24:04
of European descent.
24:05
Thus, if a C is seen at this particular SNP, the individual will be much more
24:10
likely to
24:11
be of Asian descent as compared to European descent.
24:18
With this principle in mind, assessment of SNPs across the genome with varying
24:22
frequency
24:23
and peoples of differing geographic origins may be used to imply where a
24:27
particular person
24:28
may have descended from.
24:30
In fact, this type of approach is how many of the ancestry services go about
24:35
providing
24:36
detailed ancestry data to their patrons.
24:43
To bring this type of testing in-house, we assessed at the time the various NGS
24:47
platforms
24:47
available to determine ancestry.
24:50
We identified the ion torrent S5 platform as being suitable for our purposes,
24:55
running
24:55
the Precision ID ancestry panel.
24:58
Our human DNA identification laboratory works alongside our molecular diagn
25:02
ostics laboratory
25:03
at UNMC, which was using ion torrent technology as well, giving us some
25:09
familiarity with the
25:10
strengths and weaknesses of the platform.
25:13
Further, the ion chef, Lipid Handeler, enabled West Hands-On Time for our techn
25:18
ologists so
25:18
we could better receive our workflows.
25:21
The panel we chose to assess was the Precision ID ancestry panel based off
25:25
ancestry-focused
25:27
SNPs identified and characterized by doctors' kid and selden.
25:34
From this panel, the broad regions identified and ancestry testing included
25:39
those in the
25:40
figure on screen, including America, Europe, North Africa, Middle East, Sub-
25:46
Saharan Africa,
25:47
South Asia, East Asia, as well as Oceana.
25:52
These data would then allow us to intentionally interpret the hands-sustery of
25:56
an individual
25:57
from the geographically distinct areas.
26:02
As discussed earlier, genes encoded in genomic DNA is transcribed into RNA and
26:07
subsequently
26:07
translated into protein, to building blocks that give rise to our physiological
26:12
and physical
26:12
characteristics.
26:14
To respect the question, could certain SNPs be identified which could predict
26:18
certain
26:18
traits, such as hair color and eye color?
26:23
And the obvious answer to the prior question is an unequivocal yes, and the
26:26
tools are freely
26:27
available online to perform such predictions.
26:31
Screen captures from one such tool, the Kaiser 6 blocks are shown on the screen
26:35
now.
26:36
Using six different SNPs, eye color, blue versus brown versus a mixture of the
26:40
two, can
26:41
be predicted dependent on the genotype input into the tool.
26:48
Broadening this notion out, as applicable to the platform we chose for our NGS
26:52
testing,
26:52
we identified and validated the ion, amplicity, DNA phenotyping panel to
26:56
provide phenotype
26:58
information in addition to our ancestry panel.
27:01
Based upon the work out of the Kaiser and Walsh Laboratories, the set of 17 SNP
27:06
targets
27:07
can be used to predict eye color, hair color and hair tone.
27:14
To validate NGS for use in our laboratory, 83 samples were tested over the
27:17
course of approximately
27:19
2,000 runs.
27:20
As part of our validation, we included members of our institution with varying
27:24
ethnicities,
27:25
reference materials, as well as various specimen sources.
27:30
These data were then analyzed to assess for precision and accuracy, reproduc
27:34
ibility, as
27:35
well as sensitivity.
27:37
We were also very interested in assessing whether we could deconvolute mixtures
27:41
as we
27:41
oftentimes encounter samples with more than one contributor.
27:46
To specifically assess precision and accuracy, five individuals were run
27:52
multiple times.
27:54
Resulting genotypes were at worst 97% identical between the individual runs.
28:00
Phenotypes of the ancestry is found to be 96.8% concordant with self-reported
28:05
ancestry,
28:06
with the most pronounced deviation occurring in those of Hispanic descent.
28:11
Phenotype was concordant in 93% of runs.
28:17
We next assessed sensitivity of the assay through the use of serial dilution of
28:22
input
28:22
material.
28:23
We were able to determine the optimal DNA input range to be in the 0.2-2 nan
28:28
ogram range.
28:29
Further, we assessed whether no template controls could be run to both examine
28:33
for contamination
28:34
as part of our standard workflows, as well as to help optimize our overall
28:39
workflow in
28:40
the laboratory.
28:42
Because the sequencing used, because the chips which are used for sequencing
28:46
are designed
28:47
to accommodate multiple individual samples, a balance between batching samples,
28:54
as well
28:55
as to accommodate reasonable turnaround times was an important consideration
29:00
for us.
29:00
We were pleased to determine that no template controls or blanks could be run
29:04
without affecting
29:05
overall results on a partially filled chip, which can therefore bolster both
29:10
our workflows
29:11
as well as our turnaround time.
29:15
We also wanted to confirm that a variety of specimen sources would be suitable
29:19
for this
29:19
type of testing.
29:21
Because of this, we assessed standard samples such as buckle swabs, blood, as
29:25
well as hair
29:26
roots.
29:27
We were also able to confirm that other sources would work as well, including
29:31
saliva, semen,
29:32
degraded tissues, as well as touch DNA.
29:37
These data gave us confidence that this platform would work for sources
29:40
commonly encountered
29:41
in the molecular forensics laboratory.
29:46
The last major area we were very interested in assessing and validating
29:49
concerned the
29:50
assessment of ancestry from mixtures of two individuals.
29:54
At the very least, we were interested in identifying ancestry of a major
29:58
contributor
29:58
with identification of the minor contributor being a bonus.
30:03
During our validation, we prepared mixtures of individuals at different ratios
30:08
between
30:08
one to one down to ten to one.
30:11
Using the assay supplied HID SNP genotype route, we could generally assign
30:16
ancestry to a major
30:17
contributor, especially at ratios between three to one and ten to one.
30:22
Importantly, however, the minor contributor was never able to be determined.
30:28
To illustrate why assignment of a minor contributor could not occur, we can
30:32
list how a reference
30:33
in alternate fields should be observed given various scenarios concerning the
30:37
homozygous
30:38
heterozygous status, both a major and minor contributor.
30:42
As you can see in the table to the right, we can, depending on the scenario,
30:48
determine
30:49
whether or not an individual will have either 50% between reference and
30:54
alternate if their
30:55
heterozygous or 100% for reference or 100% for alternate depending on the
31:02
status.
31:03
Now if we expand this table, and we take this scenario list and expand it, let
31:08
's suppose
31:08
in ratio of a major and minor, in this case, say an 80 to 20 ratio, we can
31:14
develop predicted
31:15
varying allele frequencies for a reference and an alternate allele to occur
31:20
given a particular
31:21
scenario.
31:22
So for example, in this 80 to 20 mixture of major and minor, where the minor
31:26
contributor
31:27
is heterozygous and the minor contributor is homozygous for the reference, we
31:32
would predict
31:33
that the varying allele frequency of the reference allele should be
31:37
approximately 60%.
31:38
We play a similar game, and we assume that the major is homozygous for the
31:42
alternate
31:43
allele, whereas the minor is homozygous for the reference allele, we would
31:49
expect the
31:50
reference allele to be present at 20% at the time.
31:54
Taking a step in the other direction, we can use the observed variant of the
31:57
allele frequency
31:58
of a reference allele determined by the equations in the rightmost column where
32:02
R equals percentage
32:03
of major to predict which scenario was in place with regards to the genotype of
32:08
the major
32:09
and minor contributor.
32:13
Using these predictions, we developed a tool that given a percentage of major
32:17
to minor
32:18
can ingest coverage data produced on our S5 and output corresponding genotypes
32:23
for a
32:23
major contributor and the minor contributor.
32:26
In the screen captures presented, a 5 to 1 ratio is analyzed, and we were able
32:30
to determine
32:31
a number of genotypes which differ between the major and the minor.
32:37
To verify our genotype deconvolution tool could reasonably perform and compare
32:43
deconvolution
32:44
results of major and minor to genotype results, and the contributors were run
32:48
as our gold
32:49
standard of a single source specimen.
32:52
We looked at the accuracy of ratios ranging from 2 to 1 out to 10 to 1.
32:57
Results are graphically represented on the left with the top line representing
33:01
the major
33:02
and the lower representing minor contributors.
33:05
The table on the right shows the number of correct alleles with the maximum
33:09
correct being
33:10
100 and maybe 7.
33:12
Also presented as the average number of alleles which were collected and I
33:15
correctly call
33:16
them along with standard deviation.
33:19
Lastly, the percentages reflect the number of alleles with a denominator of 107
33:26
Overall the decongolition tool performs best at ratios between 4 to 1 and 9 to
33:31
1 with up
33:32
to 92.5% of the minor genotype being correctly assigned.
33:37
With these data, we next want to determine whether we could ascertain useful
33:41
phenotype
33:41
and ancestry interpretations from the decongulated data.
33:46
To do so, we developed tables in order to ingest our genotype data to assign an
33:55
ancestry.
33:55
Our first goal was to recreate the Bioinformatics tools provided in the HID SNP
34:00
genotype route.
34:01
At the same time, we sought to further delineate some of the larger geographic
34:05
regions such
34:06
as the America regions in order to give information which was a little more
34:12
granular.
34:13
To do this, we collected SNP frequency data for different geographic regions
34:16
from the
34:17
alpha database, the HGDP database, the page database and from Alfred.
34:23
The frequency tables from these databases were then used in conjunction with
34:27
the Hardy-Wine
34:27
group formulas to assess likelihood of a given SNP occurring in a person
34:32
descended from a
34:33
particular region.
34:35
Snips were then combined using the product rule and the geographic regions were
34:39
ranked
34:39
as displayed on the screen now.
34:44
Overall, the results of these analyses are then presented in the following form
34:48
with
34:49
the results using each database presented individually.
34:52
Dependent on the results, some databases are excluded.
34:56
For instance, assignment of a person under the European umbrella will lead to
35:03
the exclusion
35:04
of results from the page database as the page database does not have to be
35:09
represented.
35:10
If these data are presented most of the time, people of European descent will
35:14
be presented
35:15
as being of human descent.
35:18
Databases are then combined into an overall summary panel which includes 11
35:21
geographic
35:22
regions again to increase the granularity over their original genotype results.
35:27
In particular, the original America group has been expanded into Latin America
35:31
East which
35:31
includes places like the Caribbean, Latin America West which includes places
35:36
like Mexico, North
35:37
America as well as South America.
35:40
Finally, results are presented in a radar plot to better help the reader
35:45
visualize the output
35:47
for analysis.
35:48
In the example presented on screen, the major contributor is a Latin America
35:55
West descent
35:56
whereas the minor contributor is a European descent.
36:02
To assess how the ancestry calling performed from our tool, we compared the
36:06
results using
36:08
the former HID SNP genotype tool to our developed tool and found no significant
36:14
differences
36:15
when looking at single source specimens and those data are not shown.
36:19
We then assessed how to mix your deconvolution compared with assignment using
36:23
single source
36:24
specimen and self-reporting and found that the major was conchordant
36:28
approximately 91%
36:29
of the time with the minor contributor being conchordant approximately 50% of
36:37
the time.
36:38
Because we plan to use this tool to help with investigative leads, we looked at
36:42
whether
36:42
our misses were close geographically as people's of neighboring geographic
36:47
regions may share
36:48
physical characteristics.
36:51
So for example, if we incorrectly called somebody as being from Latin America
36:56
West when they
36:57
should have in fact been North American or vice versa.
37:01
Because we planned when loosening our astringency, accuracy increased 96% per
37:07
major and 82% for
37:09
the minor and also keep in mind this includes all data examined at ratios from
37:14
3.1 out to
37:15
10.
37:16
And especially at that 10.1 ratio, our data did not perform.
37:19
Our deconvolution tool did not perform very well.
37:24
We next wanted to look more specifically at ratios where the genotype deconv
37:27
olution tool
37:28
performed best.
37:29
Our accuracy rose substantially at the 6.1 level where we saw 100% concordance
37:34
in the major
37:35
ancestry and 83% in the minor.
37:38
Allowing again for our less stringent interpretation, accuracy was up to 100%
37:43
at the 91 and 61 levels
37:45
for both major and minor contributors.
37:52
In keeping with assessing for phenotype in addition to ancestry, we also
37:57
adopted informatics
37:59
tools to assign eye and hair color for both our major and minor contributor and
38:03
the results
38:03
into output performed similarly as our ancestry tool.
38:10
So overall, the tool we have put together to deconvolute mixtures of two
38:13
individuals
38:13
to provide ancestry and phenotype data for both a major and minor looks like
38:17
the image
38:18
on the screen.
38:19
As we've worked with this tool during its development, we have identified a
38:22
number of
38:23
rules and caveats for interpretation, but overall, we think that this will be
38:27
useful
38:27
for our investigative partners.
38:30
Specific rules, for example, include identifying a major contributor of African
38:34
descent, which
38:35
may skew results for a minor contributor.
38:41
In summary, next generation sequencing allows for the assessment of many
38:45
individual genomic
38:45
locations and assessment of SNP status of these locations and can be used to
38:51
assign ancestry
38:52
and phenotype information.
38:55
Using variant allele frequency of a reference allele allows us to predict which
38:59
genotype
38:59
scenario is occurring between a major and minor contributor and applying this
39:03
over many
39:04
SNPs can help us to assign corresponding interpretive data for both ancestry
39:09
and phenotype.
39:10
Lastly, we believe that this will be a valuable tool for our investigative
39:16
partners.
39:17
And with that, I'd like to thank you for your time and attention.
39:21
I would be happy to answer any questions you may have.
39:24
A big thank you to Martha and Jessie for those incredibly interesting
39:29
presentations.
39:30
If you have a question for either of them or even both of them, now would be
39:34
the time
39:34
to submit it through the Q&A panel that you see on your screen.
39:39
And while we wait for everyone to submit their questions, why don't we do a few
39:43
polling questions
39:44
really quickly.
39:46
Our first polling question is, does your agency utilize Y chromosome analysis?
39:51
Yes, YSTRs for sexual assault, YSTRs for homicide cases, YSTRs for mixture
39:59
analysis,
40:00
YSTRs for kinship familial analysis, Y screen for sample screening, or no?
40:07
Go ahead and get your answer in there.
40:11
The largest percentage is no, but then followed by yes, YSTRs for sexual
40:17
assault cases.
40:18
So that's really interesting.
40:20
All right, let's go to our next question.
40:24
Does your agency utilize NGS next-generation sequencing analysis?
40:29
Yes, for MTDNA analysis.
40:32
Yes, for SMP analysis.
40:34
Yes, for STR analysis.
40:37
No, but we do plan to do so in the future or just no.
40:41
Let's see what everyone said.
40:45
That's awesome.
40:46
It's no, but you do plan to do so in the future, which is really interesting
40:49
because
40:50
as we know, NGS is really taking off.
40:52
So that's interesting.
40:54
All right, our last one before we get into Q&A, would you like to receive more
41:01
information
41:02
on today's webinar topics?
41:04
Whether that be about YSTRs analysis, MTDNA analysis, using NGS or SMP analysis
41:10
using
41:11
NGS.
41:12
All right, thank you everybody so much for answering those questions.
41:18
Now we will get to the fun part, the attendee questions.
41:23
So Jesse, Martha, you guys still with us?
41:26
I think so.
41:28
Awesome.
41:29
All right, Jesse then, we will give you your first question.
41:34
Jesse, can you elaborate how you subdivided the Americas and how that made for
41:39
better
41:40
predictions in your research?
41:41
Yes, so when we were going through initially kind of setting up our frequency
41:46
tables and
41:47
whatnot, the first thing that we were trying to do was more or less recreate
41:51
what the
41:52
Bioinformatics platform we had already been using for the process of going
41:56
through that
41:57
whole de-contribution process.
41:59
Since we had the opportunity to kind of expand and look at some of these other
42:04
databases,
42:05
many databases especially like Alfred, for example, have a lot of granularity
42:09
when it
42:10
comes to the different ethnicities and peoples that they've done sequencing for
42:16
And so to kind of address some of the shortcomings, especially those folks
42:20
which were of Latin
42:21
American descent, we kind of took advantage of some of that granularity to help
42:26
kind of
42:27
generate and build out some of these more subdivided groups especially over in
42:31
the Americas.
42:32
So hopefully that's helpful.
42:34
Absolutely.
42:35
Now, how would you adapt this to other panels?
42:40
And so for other panels, a lot of it is just determining which SNPs are
42:44
interrogated and
42:45
looked at there.
42:46
Dependent upon which SNPs are there, what you'd have to go to is identify the
42:50
frequency
42:51
tables for those particular SNPs.
42:53
So in order to adapt the current tool that we have right now to say a new up-to
42:58
-date
42:58
or a different platform or something, there would be a lot of kind of heavy
43:02
lifting towards
43:04
addressing that and kind of getting to that.
43:06
But it's definitely doable, it would just take some time.
43:09
Okay.
43:10
Now another one if you, Jesse, Marta, before we move on to a few questions for
43:14
you.
43:14
So Jesse, how do you focus on the Y SNPs for a certain contributor, whether
43:19
that's major
43:20
or minor?
43:21
Do you have to perform STR analysis first and then focus on the certain alleles
43:26
from that
43:26
contributor?
43:28
So the way that we're envisioning that this workflow will work is we're going
43:32
to do traditional
43:33
STR analysis to establish a proportion of major to minor.
43:40
We're not currently using YSTRs or Y SNPs to kind of delineate between major
43:44
versus minor.
43:45
So it would just be more looking at those autosomal STR contributions and what
43:51
not to
43:51
establish that we have a two-person mixture and then from there what is the
43:54
ratio between
43:55
the major and the minor.
43:56
So hopefully that's helpful.
43:58
Absolutely.
43:59
Thank you.
44:00
I'm sure we'll come back to you.
44:02
But Marta, we want to ask you a few questions that we have from the audience.
44:06
The first one is, can you tell us in your study what were the biggest technical
44:12
issues
44:12
that you faced?
44:14
The biggest technical issue was actually with the bone material itself.
44:19
In a project we collected almost 800 bone samples and the ones collected in
44:23
probably
44:24
Boudoir.
44:25
One of the most liquid databases we had to work with, we implemented many
44:30
different DNA
44:31
extractions methods.
44:33
So we started with PrEPFiler BTA, then we continued with the organic funnel
44:38
chloroform
44:39
and still we were not happy with the results as you saw, the first results from
44:40
Sanga
44:46
were definitely incomplete.
44:48
So the next step was introducing the in-house methods from in-group for the DNA
44:54
extraction
44:55
that was lately published and luckily we had enough extract for each individual
45:02
to continue
45:03
with their next generation sequencing.
45:06
And we got home microgenomes and the Y chromosome data, but the SDR, the Y SDR
45:14
sequencing had
45:15
to be repeated many times and the mitochondrial and DNA was also extremely
45:20
degradation.
45:21
So it was a big technical challenge to even get the satisfying results.
45:26
Okay.
45:27
Can you comment and talk to us about how adding whole genome MIDO analysis by N
45:33
GS actually
45:34
aided in the identification that you were able to make?
45:38
Yes.
45:39
So without my Tohonjua, the homo-tohonjua genome, we would not get phylogenetic
45:44
resolution
45:45
that we wanted.
45:47
So the slides of Sanga showed that for three individuals, we got the prediction
45:51
of K-HAPLO
45:53
group, even though for two of them, the control regions of HV1 and HV2 allowed
45:59
only to predict
46:00
the HAPLO group SK, which was definitely not informative enough.
46:04
So thanks to the whole genome sequencing, we got a higher phylogenetic
46:08
resolution, first
46:09
of all, and second of all, when we were comparing the homo-tohonjua genome
46:15
sequences with the
46:17
reference data from Polish and Ashkenazi, Michael Genome, we definitely got
46:25
also a better
46:26
understanding of where the results fall in phylogenetically.
46:30
So we were able to cluster the micro-hondyl genome, which was not able only
46:35
based on the
46:36
control-avision sequence.
46:38
Okay, gotcha.
46:39
Now, what was the biggest challenge of publishing your results?
46:45
I think that the biggest challenge was how complex our study is, and even
46:52
though we, like
46:53
the major focus was on DNA, obviously, and I'm a forensic scientist, and all of
46:59
the results
47:01
come from a forensic scientist, we could not ignore how complex the history
47:07
behind our
47:10
findings.
47:11
So when writing the paper, I had to go very deep into the history of everything
47:17
, and we
47:17
had to work with archeologists and anthropologists, because we could not
47:22
present the results without
47:24
exploring the history of the findings.
47:28
We also wanted people to understand how important the results are and how big
47:34
the discovery
47:35
is, and this would not be possible without putting all of the historical
47:39
background to
47:40
it.
47:41
And this is something we just have a very typical forensics.
47:44
So introducing so much of history and archeology into our study.
47:51
Gotcha.
47:52
Okay, and that sounds like a fascinating article.
47:55
Can you remind the audience again what's the name of it is, and it's published
47:58
in genome
47:59
biology, is that correct?
48:01
Yes, it's published in genome biology in August this year.
48:04
It's genetic and phylogenetic evidence for Jewish Holocaust victims, a found in
48:09
Saudi
48:09
border, that can, so I hope that the paper includes everything I said and
48:16
beyond.
48:17
So I think it's great if people can really learn about the story, because I
48:25
think that
48:26
this is a part of history that many really don't know much about, and I think
48:31
that our
48:32
paper just tries to shed some light on it.
48:36
Absolutely.
48:37
And I have one more question for you before we throw it back to Jessie to round
48:39
this up.
48:41
You know, what's next for you?
48:42
Are there current efforts to identify the remains?
48:45
Are you yourself planning any more humanitarian projects concerning the remains
48:49
of the Second
48:50
World War?
48:51
This would be of course an outstanding achievement to be able to identify
48:57
people.
48:58
And this is something that of course we hope for you.
49:01
We don't know yet exactly if this would be possible.
49:05
Right now when we have the phylogenetic data and we know that the people of
49:11
Jewish origin,
49:13
we have to consider who the victims could be.
49:16
And we already know that there were 200,000 people killed in Saudi border, but
49:23
prisoners
49:24
who actually destroyed the camp were the prisoners from the other camps, which
49:28
was Treblin, our
49:29
800,000 people were killed.
49:31
So we are looking potentially at 1 million victims.
49:36
So I know that there are amazing tools in the forensics right now, in the
49:41
foreign industry,
49:42
geology, we have all of the extended analysis.
49:44
So I hope that one day this study moves forward because giving names back to
49:48
those victims
49:49
would be just something absolutely great.
49:53
And since this project concerns my previous workplace and my previous team, I
50:00
right now
50:01
moves to Munich and I work at the Institute of the Kalmani-sen in Germany.
50:06
And even though I focused more on implementing NGS into routine investigation,
50:12
I never stopped
50:13
thinking and talking about bonds.
50:15
So everyone who knows me also hear about this from me almost daily because this
50:20
is just
50:20
my greatest session in forensics.
50:22
And for a while already I'm working on another humanitarian project that was
50:27
starting my
50:28
institute with other colleagues joining in that was introduced in the
50:34
scientific identification
50:35
of the foreign German soldiers from the second forward.
50:39
Oh wow, that's interesting.
50:41
We'll definitely have to look out for that.
50:43
Thanks, Marta.
50:44
Thanks.
50:45
Alright, Jesse, we have another one for you.
50:48
When do you anticipate that you can offer this to our investigative partners?
50:54
So more or less our validation is completed.
50:57
So we just have to write up a few more documents.
51:00
Obviously, so the kind of a null part of this is being able to take this NGS
51:05
data, parse
51:06
out a major contributor and a minor contributor and then obviously offer some
51:09
sort of, you
51:10
know, type and ancestry analysis on those.
51:12
So we'd like to have, you know, maybe a publication or something like that here
51:16
in the coming
51:17
months or whatnot.
51:18
But more or less, we're pretty close to being able to offer this to
51:22
investigative agencies.
51:23
Wow, that's fantastic.
51:25
Alright, and one more question for actually both of you before we call it a day
51:31
on this
51:31
webinar.
51:32
Jesse, we'll start with you.
51:34
Do you focus more on autosomal SMPs or YSMPs?
51:39
So for the system that we're using now, I mean, assay that we're using now, all
51:43
of
51:44
these ancestry SNPs and phenotypes SNPs are located on the autosomal
51:47
chromosomes.
51:48
You know, certainly if you're getting into more of the genealogy types of
51:53
studies, you're
51:54
going to broaden the number of SNPs that you're looking at.
51:57
And then when you're kind of getting into that, you can start looking at SNPs,
52:00
which
52:00
are located on X and Y respectively.
52:03
So at the current moment in time, everything that we're doing right now is more
52:07
focused
52:07
on the autosomal stuff, but certainly YSMPs are an area of interest.
52:11
It's to be kind of expand our testing more.
52:14
Marta, what about you?
52:16
Definitely on Y, which was always my biggest interest in the autosomal one.
52:23
And in everyday work, I also work with phenotypes, I think, as Jesse.
52:29
So Y chromosomes are of interest for me, also from the phylogenetic perspective
52:35
So definitely Y chromosomes.
52:37
Gotcha.
52:38
Okay.
52:39
Thank you guys.
52:40
All right.
52:41
Audience, that about wraps up all the time we have for today.
52:44
I'd like to thank our fantastic presenters, as well as thermofuscious
52:48
scientific, who's
52:49
a sponsor for not only today's webinar, but the entire 2021 Future Trends and
52:54
DNA Technology
52:55
series.
52:56
In 24 hours or less, this webinar will be available on demand if you'd like to
53:00
watch
53:00
it again or share it with friends and colleagues.
53:04
In fact, the entire 2021 series is available on demand on the forensic website,
53:09
www.ferensicmag.com.
53:12
Additionally, later today, you will receive an email with information on how to
53:16
obtain
53:17
CE credit documentation for your participation in this webinar.
53:21
And while this unfortunately concludes the 2021 webinar series, forensics and
53:25
thermofuscious
53:26
scientific are thrilled to continue this informative series next year.
53:29
So be on the lookout for our notice in early 2022.
53:32
Thank you so much, everyone, and have a great day.