ROLEX-SP: Rules of lexical syntactic patterns for free text categorization

2011-02-01
Al Zamil, Mohammed G. H.
Betin Can, Aysu
Due to the rapid growth of free text documents available in digital form, efficient techniques of automatic categorization are of great importance. In this paper, we present an efficient rule-based method for categorizing free text documents. The contributions of this research are the formation of lexical syntactic patterns as basic classification features, a categorization framework that addresses the problem of classifying free text with minimal label description, and an efficient learning algorithm in terms of time complexity and F-measure. The framework of ROLEX-SP concentrates on capturing the correct classes of text as well as reducing classification errors.
KNOWLEDGE-BASED SYSTEMS

Suggestions

Eye Tracking Scanpath Analysis Techniques on Web Pages: A Survey, Evaluation and Comparison
Eraslan, Sukru; Yesilada, Yeliz; Harper, Simon (2016-01-01)
Eye tracking has commonly been used to investigate how users interact with web pages, with the goal of improving their usability. This article comprehensively revisits the techniques that could be applicable to eye tracking data for analysing user scanpaths on web pages. It also uses a third-party eye tracking study to compare these techniques. This allows researchers to recognise existing techniques for their goals, understand how they work and know their strengths and limitations so that they can make an ...
CONTENT BASED HYPERSPECTRAL IMAGE RETRIEVAL USING BAG OF ENDMEMBERS IMAGE DESCRIPTORS
Omruuzun, Fatih; Demir, Begum; Bruzzone, Lorenzo; Çetin, Yasemin (2016-08-24)
This paper proposes a novel system for fast and accurate content based retrieval of hyperspectral images. The proposed system aims at retrieving hyperspectral images that have both similar spectral characteristics associated with specific materials and fractional abundances to the query image. It consists of two modules. The first module characterizes the query and the target hyperspectral images in the archive by two descriptors: 1) a binary spectral descriptor representing spectral characteristics of dist...
Second Chance: A Hybrid Approach for Dynamic Result Caching in Search Engines
Altıngövde, İsmail Sengör; Barla Cambazoglu, B.; Ulusoy, Ozgur (2011-01-01)
Result caches are vital for efficiency of search engines. In this work, we propose a novel caching strategy in which a dynamic result cache is split into two layers: an HTML cache and a docID cache. The HTML cache in the first layer stores the result pages computed for queries. The docID cache in the second layer stores ids of documents in search results. Experiments under various scenarios show that, in terms of average query processing time, this hybrid caching approach outperforms the traditional approac...
Crossing: a framework to develop knowledge-based recommenders in cross domains
Azak, Mustafa; Birtürk, Ayşe Nur; Department of Computer Engineering (2010)
Over the last decade, excess amount of information is being provided on the web and information filtering systems such as recommender systems have become one of the most important technologies to overcome the „Information Overload‟ problem by providing personalized services to users. Several researches have been made to improve quality of recommendations and provide maximum user satisfaction within a single domain based on the domain specific knowledge. However, the current infrastructures of the recommende...
WaPUPS: Web access pattern extraction under user-defined pattern scoring
Alkan, Oznur Kirmemis; Karagöz, Pınar (2016-04-01)
Extracting patterns from web usage data helps to facilitate better web personalization and web structure readjustment. The classical frequency-based sequence mining techniques consider only the binary occurrences of web pages in sessions that result in the extraction of many patterns that are not informative for users. To handle this problem, utility-based mining technique has emerged, which assigns non-binary values, called utilities, to web pages and calculates pattern utilities accordingly. However, the ...
Citation Formats
M. G. H. Al Zamil and A. Betin Can, “ROLEX-SP: Rules of lexical syntactic patterns for free text categorization,” KNOWLEDGE-BASED SYSTEMS, pp. 58–65, 2011, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/30356.