Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
ENHANCING SPLICE VARIANT PREDICTION: EVALUATING BIOINFORMATICS TOOLS AND THE IMPACT OF TRAINING DATA IN THE CONTEXT OF GENETIC DISORDERS
Download
Elif_Güney_Tamer_PhD_thesis.pdf
Elif Güney Tamer_Tez Teslim Belgeleri.pdf
Date
2026-1-15
Author
Güney Tamer, Elif
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
86
views
0
downloads
Cite This
Accurate identification of splice-altering genetic variants is critical for understanding disease mechanisms and improving clinical variant interpretation. Although deep learning–based splice prediction tools perform well for canonical splice-site variants, their ability to detect exonic splice-altering variants remains limited. This limitation is primarily due to the scarcity of experimentally validated exonic variants and model architectures optimized for canonical splice motifs rather than regulatory exonic regions. In this thesis, we systematically evaluate state-of-the-art splice prediction tools using independent, experimentally validated datasets, with a specific focus on exonic variant performance. We curated and assembled the largest validated exonic splice-altering variant dataset reported to date, including both pathogenic and benign variants. Benchmarking analyses revealed consistent performance degradation across tools for exonic variants compared to canonical splice-site mutations. To address this gap, we retrained the Pangolin deep learning model by explicitly incorporating validated exonic splice variants into its training data. While the retrained model did not surpass the overall performance of the original Pangolin model, it demonstrated improved sensitivity, stability, and reduced false-negative rates for exonic splice-altering variants, particularly at higher prediction thresholds. Notably, the model showed improved detection of variants located in exonic splicing enhancer and silencer regions (ESE/ESS). Overall, this study provides a comprehensive evaluation of current splice prediction tools, demonstrates the benefit of targeted retraining for exonic variant detection, and establishes a foundation for developing more accurate and clinically relevant splice-altering variant prediction models.
Subject Keywords
Splice site variants
,
Deep Learning
,
Clinical variant prediction
URI
https://hdl.handle.net/11511/118434
Collections
Graduate School of Informatics, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
E. Güney Tamer, “ENHANCING SPLICE VARIANT PREDICTION: EVALUATING BIOINFORMATICS TOOLS AND THE IMPACT OF TRAINING DATA IN THE CONTEXT OF GENETIC DISORDERS,” Ph.D. - Doctoral Program, Middle East Technical University, 2026.