Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Lokman : a medical ontology based topical web crawler
Download
index.pdf
Date
2005
Author
Kyışoğlu, Altuğ
Metadata
Show full item record
Item Usage Stats
229
views
784
downloads
Cite This
Use of ontology is an approach to overcome the أsearch-on-the-netؤ problem. An ontology based web information retrieval system requires a topical web crawler to construct a high quality document collection. This thesis focuses on implementing a topical web crawler with medical domain ontology in order to find out the advantages of ontological information in web crawling. Crawler is implemented with Best-First search algorithm. Design of the crawler is optimized to UMLS ontology. Crawler is tested with Harvest Rate and Target Recall Metrics and compared to a non-ontology based Best-First Crawler. Performed test results proved that ontology use in crawler URL selection algorithm improved the crawler performance by 76%.
Subject Keywords
Computer software.
URI
http://etd.lib.metu.edu.tr/upload/2/12606468/index.pdf
https://hdl.handle.net/11511/15458
Collections
Graduate School of Informatics, Thesis
Suggestions
OpenMETU
Core
Ant Colony Optimization based clustering methodology
İNKAYA, TÜLİN; Kayaligil, Sinan; Özdemirel, Nur Evin (Elsevier BV, 2015-03-01)
In this work we consider spatial clustering problem with no a priori information. The number of clusters is unknown, and clusters may have arbitrary shapes and density differences. The proposed clustering methodology addresses several challenges of the clustering problem including solution evaluation, neighborhood construction, and data set reduction. In this context, we first introduce two objective functions, namely adjusted compactness and relative separation. Each objective function evaluates the cluste...
A new reactive method for processing web usage data
Bayır, Murat Ali; Toroslu, İsmail Hakkı; Department of Computer Engineering (2006)
In this thesis, a new reactive session reconstruction method 'Smart-SRA' is introduced. Web usage mining is a type of web mining, which exploits data mining techniques to discover valuable information from navigations of Web users. As in classical data mining, data processing and pattern discovery are the main issues in web usage mining. The first phase of the web usage mining is the data processing phase including session reconstruction. Session reconstruction is the most important task of web usage mining...
Systematic component-oriented development with axiomatic design
Toğay, Cengiz; Doğru, Ali Hikmet; Department of Computer Engineering (2008)
In this research, component oriented development is supported with design guidance by extending the Axiomatic Design Theory for component orientation, and utilizing domain engineering and ontology mechanisms. Guidance is offered in the form of suggesting missing components and discovering incompatibilities among the candidate elements of software development, corresponding to different phases such as requirement analysis, design, and implementation. A mature domain concept is developed suggesting the availa...
A systematic study of probabilistic aggregation strategies in swarm robotic systems
Soysal, Onur; Şahin, Erol; Department of Computer Engineering (2005)
In this study, a systematic analysis of probabilistic aggregation strategies in swarm robotic systems is presented. A generic aggregation behavior is proposed as a combination of four basic behaviors: obstacle avoidance, approach, repel, and wait. The latter three basic behaviors are combined using a three-state finite state machine with two probabilistic transitions among them. Two different metrics were used to compare performance of strategies. Through systematic experiments, how the aggregation performa...
Improving search result clustering by integrating semantic information from Wikipedia
Çallı, Çağatay; Üçoluk, Göktürk; Şehitoğlu, Onur Tolga; Department of Computer Engineering (2010)
Suffix Tree Clustering (STC) is a search result clustering (SRC) algorithm focused on generating overlapping clusters with meaningful labels in linear time. It showed the feasibility of SRC but in time, subsequent studies introduced description-first algorithms that generate better labels and achieve higher precision. Still, STC remained as the fastest SRC algorithm and there appeared studies concerned with different problems of STC. In this thesis, semantic relations between cluster labels and documents ar...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Kyışoğlu, “Lokman : a medical ontology based topical web crawler,” M.S. - Master of Science, Middle East Technical University, 2005.