Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Event Extraction from Turkish Football Web-casting Texts Using Hand-crafted Templates
Date
2009-09-16
Author
Doruk, Tunaoğlu
Alan, Özgür
Orkunt, Sabuncu
Samet, Akpınar
Çiçekli, Fehime Nihan
Alpaslan, Ferda Nur
Metadata
Show full item record
Item Usage Stats
162
views
0
downloads
Cite This
In this paper, we present a domain specific information extraction approach. We use manually formed templates to extract information from unstructured documents where grammatical and syntactical errors occur frequently. We applied our approach to primarily Turkish unstructured soccer Web-casting texts. Compared to automated approaches we achieve high precision-recall rates (97% - 85%). In addition to that, unlike automated approaches we do not use part-of-speech taggers, parsers, phrase chunkers or that kind of a linguistic tool. As a result, our approach can be applied to any domain or any language without the necessity of successful linguistic tools. The drawback of our approach is the time spent on crafting the templates. We also propose the means to decrease that time.
Subject Keywords
Semantic Web
,
Data mining
,
Intelligent systems
,
Games
,
Internet
,
Cities and towns
,
Computer errors
,
Information analysis
,
Buildings
,
Search engines
,
Ontologies
,
Information retrieval
,
Sport
,
Text analysis
URI
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5298635
https://hdl.handle.net/11511/72205
Conference Name
IEEE International Conference on Semantic Computing (ICSC 2009) (14 - 16 Eylül 2009)
Collections
Department of Computer Engineering, Conference / Seminar
Suggestions
OpenMETU
Core
Event Extraction from Turkish Football Web-casting Texts Using Hand-crafted Templates
Tunaoglu, Doruk; Alan, Oezguer; Sabuncu, Orkunt; Akpinar, Samet; Cicekli, Nihan K.; Alpaslan, Ferda Nur (2009-09-16)
In this paper, we present a domain specific information extraction approach We use manually formed templates to extract information from unstructured documents where grammatical and syntactical errors occur frequently We applied our approach to primarily Turkish unstructured soccer web-casting texts Compared to automated approaches we achieve high precision-recall rates (97% - 85%). In addition to that, unlike automated approaches we do not use part-of-speech taggers, parsers, phrase chunkers or that kind o...
Text Classification in the Turkish Marketing Domain for Context Sensitive Ad Distribution
Engin, Melih; Can, Tolga (2009-09-16)
In this paper, we construct and compare several feature extraction approaches in order to find a better solution for classification of Turkish web documents in the marketing domain. We produce our feature extraction techniques using characteristics of the Turkish language, structures of web documents and online content in the marketing domain. We form datasets in different feature spaces and we apply several Support Vector Machine (SVM) configurations on these datasets. We conduct our study considering the ...
Process based information systems evaluation: Towards the attributes of "pRISE"
Özkan Yıldırım, Sevgi; Bilgen, Semih (2007-10-31)
Purpose The purpose of this paper is to demonstrate the importance of undertaking a systemic view of information systems evaluation that augments the frequently reported prescriptive (cost/benefit) analysis approaches. Design/methodology/approach The paper adopts a qualitative case perspective and derives a framework for substantive information systems evaluation factors (PRISE). Three empirical formulations are considered and a comparison made to determine the content and context of the findings. Finding...
A framework for ranking and categorizing medical documents
Al Zamıl, Mohammed GH. I.; Betin Can, Aysu; Department of Information Systems (2010)
In this dissertation, we present a framework to enhance the retrieval, ranking, and categorization of text documents in medical domain. The contributions of this study are the introduction of a similarity model to retrieve and rank medical textdocuments and the introduction of rule-based categorization method based on lexical syntactic patterns features. We formulate the similarity model by combining three features to model the relationship among document and construct a document network. We aim to rank ret...
Topic-centric querying of web information resources
Altıngövde, İsmail Sengör; Ulusoy, O; Ozsoyoglu, G; Ozsoyoglu, ZM (2001-01-01)
This paper deals with the problem of modeling web information resources using expert knowledge and personalized user information, and querying them in terms of topics and topic relationships. We propose a model for web information resources, and a query language SQL-TC (Topic-Centric SQL) to query the model. The model is composed of web-based information resources (XML or HTML documents on the web), expert advice repositories (domain-expert-specified metadata for information resources), and personalized inf...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
T. Doruk, Ö. Alan, S. Orkunt, A. Samet, F. N. Çiçekli, and F. N. Alpaslan, “Event Extraction from Turkish Football Web-casting Texts Using Hand-crafted Templates,” presented at the IEEE International Conference on Semantic Computing (ICSC 2009) (14 - 16 Eylül 2009), Berkeley, CA, USA, 2009, Accessed: 00, 2021. [Online]. Available: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5298635.