Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Fine-tuning medical large language models for differential diagnosis: from synthetic data to real-world evaluation
Download
Ezgi_Cavas_Thesis.pdf
Date
2026-1
Author
Çavaş, Ezgi
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
66
views
38
downloads
Cite This
Access to large-scale, annotated Electronic Health Records (EHR) is limited by privacy rules. This creates a major setback for training strong clinical natural language processing (NLP) models. Synthetic data provides a way to protect privacy, but how well synthetic text works for fine-tuning large language models (LLMs) in real-world tasks is still an important issue to explore. Our thesis presents a framework that uses synthetic patient summaries to fine-tune a medical LLM model for multi-label disease diagnosis. This approach offers a cost-effective and privacy-focused method for creating clinical diagnostic tools with minimal use of sensitive real-world data. The results show that synthetic data can successfully reshape the medical models. This also helps the hospitals that are struggling with triage and the overcrowding of patients.
Subject Keywords
Clinical transformers
,
Synthetic data augmentation
,
Electronic health records
,
Computational phenotyping
URI
https://hdl.handle.net/11511/118704
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
E. Çavaş, “Fine-tuning medical large language models for differential diagnosis: from synthetic data to real-world evaluation,” M.S. - Master of Science, Middle East Technical University, 2026.