Large-scale cluster-based retrieval experiments on Turkish texts

Download
2007-11-30
Altıngövde, İsmail Sengör
Ocalan, Huseyin Cagdas
Can, Fazli
Ulusoy, Özgür
We present cluster-based retrieval (CBR) experiments on the largest available Turkish document collection. Our experiments evaluate retrieval effectiveness and efficiency on both an automatically generated clustering structure and a manual classification of documents. In particular, we compare CBR effectiveness with full-text search (FS) and evaluate several implementation alternatives for CBR. Our findings reveal that CBR yields comparable effectiveness figures with FS. Furthermore, by using a specifically tailored cluster-skipping inverted index we significantly improve in-memory query processing efficiency of CBR in comparison to other traditional CBR techniques and even FS.

Suggestions

Incremental cluster-based retrieval using compressed cluster-skipping inverted files
Altıngövde, İsmail Sengör; Can, Fazli; Ulusoy, Oezguer (2008-01-01)
We propose a unique cluster-based retrieval (CBR) strategy using a new cluster-skipping inverted file for improving query processing efficiency. The new inverted file incorporates cluster membership and centroid information along with the usual document information into a single structure. In our incremental-CBR strategy, during query evaluation, both best(-matching) clusters and the best(-matching) documents of such clusters are computed together with a single posting-list access per query term. As we swit...
Fast pyrolysis of Turkish hazelnut shell by using novel wire mesh reactor
Kazanç Özerinç, Feyza; Gürel, Kaan (null; 2018-04-27)
The present paper studies fast pyrolysis of Turkish hazelnut shell under various conditions in a novel wire mesh reactor (WMR). Particular emphasis was given to understand volatile yield at high heating rates at elevated temperatures. Volatile yields from fast pyrolysis (~3000 ºC/s) showed higher values from both as received (80 wt.%) and dried fuels (85 wt.%) than proximate analysis (PA) (75 wt. %) done at low heating rates (20 ºC/min). Brunauer–Emmitt–Teller (BET) surface analysis was carried out to deter...
An SGML based viewer for form documents
Atalay, Mehmet Volkan (1999-01-01)
© 1999 IEEE.Proposes a viewer for visual reconstruction and retrieval of form documents. The structure of the form document and the filled-in data are stored in an SGML (Standard Generalised Markup Language) instance. A DSSSL (Document Style Semantics and Specification Language) instance holds style properties of the data. Filled-in data is kept in a relational database. The viewer uses SGML and DSSSL instances as input and visually reconstructs a form document image by retrieving the corresponding filled-i...
A quantitative framework for testing the resilience of Islamic finance portfolios under IFSB and Basel capital rules
Aydin, Nadi Serhan (2017-01-01)
Purpose - This paper aims to introduce a model-based stress-testing methodology for Islamic finance products. The importance of stress testing was indeed clearly underlined by the adverse developments in the global finance industry. One of the key takeaways was the need to strengthen the coverage of the capital framework. Cognisant of this fact, Basel III encapsulates provisions to enhance the financial sector's ability to withstand shocks arising from possible stress events, thereby reducing adverse spillo...
A study on the building techniques and materials in the late antique and byzantine fortifications in anatolia: Ancyra and Nicaea /
Yavuzatmaca, Mercan; Serin, Ufuk; Conservation of Cultural Heritage in Department of Architecture (2016)
This research aims to investigate building techniques and materials in the Late Antique and Byzantine fortifications of Anatolia through the selected case studies of Ancyra/Ankara and Nicaea/Iznik. The majority of Late Antique and Byzantine fortifications in Anatolia are distinguished by ashlar masonry, including quantities of spolia, with alternating courses of brick. The frequent appearance of brick, in combination with more-or-less regularly cut blocks or spolia, in the buildings and fortifications of An...
Citation Formats
İ. S. Altıngövde, H. C. Ocalan, F. Can, and Ö. Ulusoy, “Large-scale cluster-based retrieval experiments on Turkish texts,” 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/34457.