Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
A Machine Learning Framework for High-Frequency Midprice Prediction: Feature Engineering, Hyperparameter Space Exploration, and Profit-Loss Evaluation
Download
10782085.pdf
Date
2026-1
Author
Tiryaki, Barış
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
83
views
0
downloads
Cite This
This thesis studies short-horizon mid-price direction prediction in high-frequency trading using limit order book (LOB) data from Borsa Istanbul. A machine learning framework based on XGBoost is proposed to predict whether the mid-price will move up, stay the same, or move down over a short time horizon. A new feature called Effective Q-Price (EQP) is introduced in the scope of the thesis to better represent market liquidity. EQP combines price and quantity information across multiple depth levels and can be seen as a liquidity-aware extension of the standard mid-price. In addition, a regression-based classification approach is proposed, where a single continuous output is mapped to three trading signals. The models are trained using feature-rich representations extracted from the LOB, and the effects of key hyperparameters are analyzed in a systematic evaluation setup. The imbalanced class labels are addressed with class weighting and decision-threshold tuning. The predictive performance and the complexity of the model are evaluated. To this end, a further contribution of the thesis is a profit and loss (P&L) backtest that converts predictions into trading positions while accounting for transaction costs. The results show that EQP is consistently among the most informative features and allows feature reduction without a clear loss in predictive performance or P&L. The regression-based approach achieves performance similar to direct classification while using smaller and simpler models.
Subject Keywords
High-Frequency Trading
,
Mid-Price Prediction
,
Limit Order Book
,
XGBoost
,
Feature Engineering
URI
https://hdl.handle.net/11511/118390
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
B. Tiryaki, “A Machine Learning Framework for High-Frequency Midprice Prediction: Feature Engineering, Hyperparameter Space Exploration, and Profit-Loss Evaluation,” M.S. - Master of Science, Middle East Technical University, 2026.