CREDIT CARD FRAUD DETECTION WITH AUTOENCODERS, ONE-CLASS SVMS AND ISOLATION FORESTS

2023-12-6
Özkum, Özdemir
Fraud Detection has gained importance in the last few years due to its cost to the economy and the population in general. However, there are a number of important problems in the modeling process to tackle when detecting fraud. One is the availability of data. Since the financial data produced by the customers is subject to privacy rules, working with synthetic data is a necessity. While this is the case, there are very few data sets that reflect the true nature of the data generation and analysis processes of financial transactions. Another problem is that the fraud or any data that require anomaly detection do not have labeled data. Even labeled, the data points are labeled by rules and there is a need to automate this process. This is why it is deemed important to incorporate an unsupervised collection of methods and let the data speak for itself. For these reasons, this study focuses on the unsupervised learning methods in fraud detection using a synthetic data set which consists of credit card transactions. The learning methods used were the Autoencoders, Sparse Autoencoders, One-Class Support Vector Machines and Isolation Forests. For comparison, a Random Forest Model was also built. It was found that among the unsupervised methods One-Class SVM was the best performing model with Precision=0.68, Recall=0.98, and F1 Score=0.81. However, One-Class SVM did not outperform the supervised Random Forest model which achieved Precision=1.00, Recall=0.95, and F1-Score=0.97.
Citation Formats
Ö. Özkum, “CREDIT CARD FRAUD DETECTION WITH AUTOENCODERS, ONE-CLASS SVMS AND ISOLATION FORESTS,” M.S. - Master of Science, Middle East Technical University, 2023.