Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
FACDO: Feature-Aware Conditional Diffusion Oversampler for Robust Minority Oversampling in Imbalanced Credit Risk and Fraud Classification
Download
FACDO Feature-Aware Conditional Diffusion Oversampler for Robust Minority Oversampling in Imbalanced Credit Risk and Fraud Classification.pdf
Onur Yaman Tez Belgeleri.pdf
Date
2026-1-22
Author
YAMAN, ONUR
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
149
views
0
downloads
Cite This
Datasets used in credit risk modelling and fraud detection are typically highly imbalanced, where defaults or fraudulent transactions constitute only a small fraction of all observations. Accurately identifying these minority events is crucial, since missed defaults lead to underestimated expected credit losses, inaccurate capital requirements, and loan portfolio mispricing, while undetected fraud causes direct financial losses and operational risk. In such settings, classical classifiers, particularly gradient-boosted decision trees such as LightGBM and XGBoost, often struggle because the binary cross-entropy objective is dominated by the majority class. This thesis proposes the Feature Aware Conditional Diffusion Oversampler, a diffusion based oversampling framework tailored for severely imbalanced financial tabular data. Feature Aware Conditional Diffusion Oversampler builds on the Denoising Diffusion Probabilistic Model paradigm and extends it with task-oriented mechanisms to improve minority sample quality. Specifically, Feature Aware Conditional Diffusion Oversampler conditions the denoising process using Shapley Additive Explanations derived feature-importance information via Feature Wise Linear Modulation, encouraging generation toward class-consistent regions of the minority manifold. Classifier free guidance further shapes the sampling trajectory, and a two-stage filtering strategy based on geometric proximity and probability-based consistency removes low-quality candidates and retains informative samples. Samples generated by Feature Aware Conditional Diffusion Oversampler are used to augment training data, and downstream LightGBM and XGBoost classification models are evaluated on two probability-of-default datasets and one credit card fraud dataset. Results show that proposed method consistently improves minority-sensitive metrics such as Recall, F1-score, G-Mean, and Area Under Precision Recall Curve compared to widely used oversampling baselines, supporting more reliable detection of financially critical minority events.
Subject Keywords
Imbalanced Classification, Diffusion Based Oversampling, Feature Importance Guided Generation, Credit Risk Modelling, Fraud Detection
URI
https://hdl.handle.net/11511/118495
Collections
Graduate School of Applied Mathematics, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
O. YAMAN, “FACDO: Feature-Aware Conditional Diffusion Oversampler for Robust Minority Oversampling in Imbalanced Credit Risk and Fraud Classification,” M.S. - Master of Science, Middle East Technical University, 2026.