Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Developing a text categorization template for Turkish news portals
Date
2011-08-11
Author
Toraman, Çağrı
Can, Fazli
Koçberber, Seyit
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
10
views
0
downloads
Cite This
In news portals, text category information is needed for news presentation. However, for many news stories the category information is unavailable, incorrectly assigned or too generic. This makes the text categorization a necessary tool for news portals. Automated text categorization (ATC) is a multifaceted difficult process that involves decisions regarding tuning of several parameters, term weighting, word stemming, word stopping, and feature selection. In this study we aim to find a categorization setup that will provide highly accurate results in ATC for Turkish news portals. We also examine some other aspects such as the effects of training dataset set size and robustness issues. Two Turkish test collections with different characteristics are created using Bilkent News Portal. Experiments are conducted with four classification methods: C4.5, KNN, Naive Bayes, and SVM (using polynomial and rbf kernels). Our results recommends a text categorization template for Turkish news portals and provides some future research pointers. © 2011 IEEE.
Subject Keywords
news portals
,
text categorization
,
Turkish news
URI
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=79961178944&origin=inward
https://hdl.handle.net/11511/109640
DOI
https://doi.org/10.1109/inista.2011.5946096
Conference Name
2011 International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2011
Collections
Department of Computer Engineering, Conference / Seminar
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
Ç. Toraman, F. Can, and S. Koçberber, “Developing a text categorization template for Turkish news portals,” presented at the 2011 International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2011, Istanbul-Kadikoy, Türkiye, 2011, Accessed: 00, 2024. [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=79961178944&origin=inward.