Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Named entity recognition and explainability analysis on Turkish sports news texts
Download
10601332.pdf
Date
2023-12
Author
Kılıç, Yüksel Pelin
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
148
views
90
downloads
Cite This
In Natural Language Processing (NLP) and Information Extraction, Named Entity Recognition (NER) is a significant challenge. NER involves identifying entities like Person, Location, and Organization from text. While NER is well-researched in English and Chinese, Turkish NER lags, especially in domain-specific areas like sports. The sports industry has seen a remarkable transformation with the convergence of sports and technology, impacting performance enhancement, fan engagement, and management. There is an untapped potential in extracting qualitative insights from textual data, offering a deeper understanding of the dynamics between athletes, teams, and supporters. One key area needing further exploration is applying deep learning techniques to Turkish NER, particularly in comparison with traditional methods. Additionally, there is a lack of research on the interpretability and explainability of transformer-based models in this context. This study introduces domain-specific Turkish NER data sets, mainly those relevant to sports, to evaluate the effectiveness of transformer-based models in Turkish NER. A significant aspect of this research is comparing these models and analyzing how different annotation formats impact the results. The effects of named entity distribution on model performance are investigated through cross-validation techniques. Another crucial component of this study is focusing on interpretability. By employing interpretability methods, we aim to uncover the rationale and mechanisms behind the model predictions. This aspect is crucial in understanding how these models function and make decisions, a relatively under-explored area in Turkish NER. This research contributes to NLP and Information Extraction and has implications for enriching sports research and management, providing new insights into the interaction between sports and technology.
Subject Keywords
Named entity recognition
,
Sports domain
,
Google BERT
,
LIME
,
SHAP
,
Sequence tagging
,
Token classification
URI
https://hdl.handle.net/11511/107643
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
Y. P. Kılıç, “Named entity recognition and explainability analysis on Turkish sports news texts,” M.S. - Master of Science, Middle East Technical University, 2023.