An automatic approach to construct domain-specific web portals

Download

index.pdf

Date

2007-12-01

Author

Altıngövde, İsmail Sengör
Cetintas, Suleyman
Yilmaz, Hakan
Ulusoy, Özgür

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

184
views

0
downloads

We describe the architecture of an automatic domain-specific Web portal construction system. The system has three major components: i) a focused crawler that collects the domain-specific pages on the Web, ii) an information extraction engine that extracts useful fields from these Web pages, and iii) a query engine that allows both typical keyword based queries on the pages and advanced queries on the extracted data fields. We present a prototype system that works for the course homepages domain on the Web. A user study with the prototype system shows that our approach produces high quality results and achieves better precision figures than the typical keyword based search.

Subject Keywords

Focused Crawling, Information Extraction, Querying

URI

https://hdl.handle.net/11511/36162

DOI

https://doi.org/10.1145/1321440.1321558

Collections

Department of Computer Engineering, Conference / Seminar

Suggestions

OpenMETU
Core

A Petri net approach to behavioural simulation of design artefacts with application to mechatronic design Erden, Z; Erden, A; Erkmen, Aydan Müşerref (2003-02-01) A Petri net-based design inference network (PNDN) architecture is presented in this paper. The network models the logical behaviour of any design artefact developed by designers at the conceptual design level by representing the subfunctions and their inter-relationships to perform a required overall function. The theoretical framework in developing the PNDN is based on the improved theory of Petri nets and hybrid automata. The theoretical PNDN architecture was implemented in a C++ based software called the...
A metamodel for federation architectures Topcu, Okan; Adak, Mehmet; Oğuztüzün, Mehmet Halit S. (2008-07-01) This article proposes a metamodel for describing the architecture of a High Level Architecture (HLA) compliant federation. A salient feature of the Federation Architecture Metamodel (FAMM) is the behavioral description of federates based on live sequence charts. FAMM formalizes the standard HLA Object Model and Federate Interface Specification. FAMM supports processing through automated tools, and in particular through code generation. It is formulated in metaGME, the metamodel for the Generic Modeling Envi...
An Evolutionary Genetic Algorithm for Optimization of Distributed Database Queries Sevinc, Ender; Coşar, Ahmet (2011-05-01) High-performance low-cost PC hardware and high-speed LAN/WAN technologies make distributed database (DDB) systems an attractive research area where query optimization and DDB design are the two important and related problems. Since dynamic programming is not feasible for optimizing queries in a DDB, we propose a new genetic algorithm (GA)-based query optimizer (new genetic algorithm (NGA)) and compare its performance with random and optimal (exhaustive) algorithms. We perform experiments on a synthetic data...
A Cost-Aware Strategy for Query Result Caching in Web Search Engines Altıngövde, İsmail Sengör; Ulusoy, Oezguer (2009-01-01) Search engines and large scale IR systems need to cache query results for efficiency and scalability purposes. In this study, we propose to explicitly incorporate the query costs in the static caching policy. To this end, a query’s cost is represented by its execution time, which involves CPU time to decompress the postings and compute the query-document similarities to obtain the final top-N answers. Simulation results using a large Web crawl data and a real query log reveal that the proposed strategy impr...
A Graph-based core model and a hybrid recommender system for TV users Taşcı, Arda; Çiçekli, Fehime Nihan; Department of Computer Engineering (2015) This thesis proposes a core model to represent user profiles in a graph-based environment which can be the base of different recommender system approaches as well as other cutting edge applications for TV domain. The proposed graph-based core model is explained in detail with node types, properties and edge weight metrics. The capabilities of this core model are described in detail. Moreover, in this thesis, a hybrid recommender system based on this core model is presented with its design, development and e...

Citation Formats

İ. S. Altıngövde, S. Cetintas, H. Yilmaz, and Ö. Ulusoy, “An automatic approach to construct domain-specific web portals,” 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/36162.