Utility Based Resolution of Data Inconsistencies

2004-07-18
MOTRO, AMIHAI
ANOKHIN, PHILIPP
Acar, Aybar Can
A virtual database system is software that provides unified access to multiple information sources. If the sources are overlapping in their contents and independently maintained, then the likelihood of inconsistent answers is high. Solutions are often based on ranking (which sorts the different answers according to recurrence) and on fusion (which synthesizes a new value from the different alternatives according to a specific formula). In this paper we argue that both methods are flawed, and we offer alternative solutions that are based on knowledge about the performance of the source data; including features such as recentness, availability, accuracy and cost. These features are combined in a flexible utility function that expresses the overall value of a data item to the user. Utility allows us to (1) define meaningful ranking on the inconsistent set of answers, and offer the topranked answer as a preferred answer; (2) determine whether a fusion value is indeed better than the initial values, by calculating its utility and comparing it to the utilities of the initial values; and (3) discover the best fusion: the fusion formula that optimizes the utility. The advantages of such performance-based and utility-driven ranking and fusion are considerable.
IQIS '04: Proceedings of the 2004 international workshop on Information quality in information systems

Suggestions

Synthesis of Monitoring Rules with STL
Aydin, Sertac Kagan; Aydın Göl, Ebru (World Scientific Pub Co Pte Lt, 2020-09-01)
Online monitoring is essential to enhance the reliability for various systems including cyber-physical systems and Web services. During online monitoring, the system traces are checked against monitoring rules in real time to detect deviations from normal behaviors. In general, the rules are defined as boundary conditions by the experts of the monitored system. This work studies the problem of synthesizing online monitoring rules in the form of temporal logic formulas in an automated way. The monitoring rul...
Retrospective adaptive prefetching for interactive Web GIS applications
Yesilmurat, Serdar; İşler, Veysi (2012-07-01)
A major task of a Web GIS (Geographic Information Systems) system is to transfer map data to client applications over the Internet, which may be too costly. To improve this inefficient process, various solutions are available. Caching the responses of the requests on the client side is the most commonly implemented solution. However, this method may not be adequate by itself. Besides caching the responses, predicting the next possible requests from a client and updating the cache with responses for those re...
Flexible Content Extraction and Querying for Videos
Demir, Utku; KOYUNCU, Murat; Yazıcı, Adnan; Yilmaz, Turgay; SERT, MUSTAFA (2011-10-28)
In this study, a multimedia database system which includes a semantic content extractor, a high-dimensional index structure and an intelligent fuzzy object-oriented database component is proposed. The proposed system is realized by following a component-oriented approach. It supports different flexible query capabilities for the requirements of video users, which is the main focus of this paper. The query performance of the system (including automatic semantic content extraction) is tested and analyzed in t...
Clustering scientific literature using sparse citation graph analysis
Bolelli, Levent; Ertekin Bolelli, Şeyda; Giles, C. Lee (2006-01-01)
It is well known that connectivity analysis of linked documents provides significant information about the structure of the document space for unsupervised learning tasks. However, the ability to identify distinct clusters of documents based on link graph analysis is proportional to the density of the graph and depends on the availability of the linking and/or linked documents in the collection. In this paper, we present an information theoretic approach towards measuring the significance of individual word...
SWARM-based data delivery framework in the Ad Hoc Internet of Things
Hasan, Mohammed Zaki; Al-Turjman, Fadi (2017-12-08)
Internet of Things (IoTs) refers to the rapidly growing network of connected objects that are able to collect and exchange data using embedded sensors. To guarantee the connectivity among these objects and devices, fault tolerant routing has been received a significant attention in recent years. In this paper, we propose a bio-inspired particle multi-swarm optimization (PMSO) routing algorithm to construct, recover and select k-disjoint paths that tolerates the failure while satisfying quality of service (Q...
Citation Formats
A. MOTRO, P. ANOKHIN, and A. C. Acar, “Utility Based Resolution of Data Inconsistencies,” presented at the IQIS ’04: Proceedings of the 2004 international workshop on Information quality in information systems, 2004, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/31375.