Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Predicting first-degree relationships from ancient samples using deep neural networks
Download
mervethesis.pdf
Date
2023-8-25
Author
Güler, Merve Nur
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
375
views
245
downloads
Cite This
Estimating genetic relatedness between individuals using genomic data from ancient samples is of utmost importance; nonetheless, almost all current tools only distinguish between first, second, and third-degree relationship categories. The ability to distinguish between these two first-degree relationship categories is vital for investigating long-gone cultural practices. This study aims to differentiate between parent-offspring and sibling pairs using a Convolutional Neural Network (CNN) model in low-coverage ancient genomes. This study began by simulating founders using the population genetic simulator msprime and the pedigree simulator PedSim to create sibling and parent-offspring pairs under realistic demographic scenarios. Then, ancient DNA simulation was applied to obtain NGS (Next Generation Sequencing) reads similar to ancient genome reads by using Gargammel software. Next, using the mismatch rate, the coefficient of relatedness (r) was estimated across genomic windows containing 200 SNPs, i.e., the probability that two alleles at a given locus are identical by descent. Two-dimensional binning was applied on r values, and a CNN model was trained using the resulting fixed-length vectors for each pair. The model was tested under scenarios of different numbers of shared SNPs between parent-offspring, sibling, and unrelated pairs and achieved 1, 0.98, 0.89, 0.86, and 0.62 macro-average F1 scores for pairs sharing 50,000, 20,000, 10,000, 5,000, and 1,000 SNPs, respectively. This study demonstrates the potential for applying deep artificial neural network models to differentiate between first-degree relationships in low-coverage ancient genomes precisely and provides a foundation for future research in this field.
Subject Keywords
Ancient DNA
,
Kinship
,
First-degree Relatedness
,
CNN
,
Genetic relatedness
,
First-degree relatedness
,
Machine learning
,
Deep learning
,
Deep neural networks
,
Convolutional neural network
,
Genome simulation
URI
https://hdl.handle.net/11511/105279
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
M. N. Güler, “Predicting first-degree relationships from ancient samples using deep neural networks,” M.S. - Master of Science, Middle East Technical University, 2023.