Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Comparison of missing data imputation methods applied to daily temperature and precipitation data in Turkey
Download
Didem_Gezgen_Master_Thesis.pdf
Date
2023-8-07
Author
Gezgen, Didem
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
351
views
107
downloads
Cite This
A significant portion of the data under analysis contains missing values, which hinders the generation of meaningful results, particularly when dealing with time-dependent data where the order of observations is crucial. This issue leads to unreliable outcomes in statistical analyses applied in fields such as meteorology and economy. To address this challenge, handling missing values meticulously in time-dependent data is imperative. In this thesis, daily average temperature and total precipitation data, obtained from the General Directorate of Meteorology of Turkey, were utilized. The primary objective was to impute the missing values in these datasets using various methods and subsequently compare their performance. Missing values were intentionally introduced into the temperature and precipitation data. The methods employed for imputation included Simple Arithmetic Average Method (SAA), K-Nearest Neighbor Method (KNN), Random Forest Method (RF), Multiple Imputation by Chained Equation Method (MICE), and Generalized Adversarial Imputation Network (GAIN). The outcomes were assessed based on the Root Mean Square Error (RMSE), Coefficient of Variation of Root Mean Square Error (CVRMSE), and Nash-Sutcliffe Efficiency (NSE). The results indicated that Random Forests exhibited superior performance in most cases, followed by KNN and GAIN.
Subject Keywords
General Adversarial Imputation Network (GAIN)
,
Multiple Imputation by Chained Equation (MICE)
,
Nash-Sutcliffe Efficiency (NSE)
,
Meteorological data
,
Random Forest (RF)
URI
https://hdl.handle.net/11511/104882
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
D. Gezgen, “Comparison of missing data imputation methods applied to daily temperature and precipitation data in Turkey,” M.S. - Master of Science, Middle East Technical University, 2023.