Predicting the size of candidate document set for implicit web search result diversification

2020-01-01
© Springer Nature Switzerland AG 2020.Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for representing documents to improve diversification, and show a positive result. As a second improvement, we propose to automatically predict the size of candidate set on per query basis. Experimental evaluations using our BM25 runs as well as the best-performing ad hoc runs submitted to TREC (2009–2012) show that our approach improves the performance of implicit diversification up to 5.4% wrt. initial ranking.

Suggestions

Bayesian Networks in Project Management
Yet, Barbaros (John Wiley & Sons, West Sussex, UK , 2017-08-01)
Bayesian networks (BNs) offer unique benefits for combining data and expert knowledge to model complex joint probability distributions. Recent advances in inference algorithms enabled efficient computation of BNs with both discrete and continuous variables that are also called hybrid BNs. Consequently, BNs have been widely used as risk assessment and decision support tools in various domains including project management. This article illustrates the use of BNs in different aspects of project m...
Data sharing under confidentiality
Başer, Erdem; Hülagu, Timur; Akyıldız, Ersan; Bilgen, Adnan; Cenk, Murat; Keskinkurt-paksoy, İrem; Kestel, Sevtap Ayşe (2018-08-31)
Central Bank of the Republic of Turkey presents an approach to address the data sharing dilemma of maximizing the benefit for academic research while ensuring compliance with applicable data confidentiality legislations. The work in this paper compares the performance of different perturbation methods. Empirical estimates are presented over a wide range of statistical methods. The results in the paper are expected to be used to inform the design of access procedures to confidential microdata in central banks.
Bayesian Networks in Project Management
Yet, Barbaros (2017-01-01)
Bayesian networks (BNs) offer unique benefits for combining data and expert knowledge to model complex joint probability distributions. Recent advances in inference algorithms enabled efficient computation of BNs with both discrete and continuous variables that are also called hybrid BNs. Consequently, BNs have been widely used as risk assessment and decision support tools in various domains including project management. This article illustrates the use of BNs in different aspects of project management and ...
Prediction of Protein-Protein Interaction Relevance of Articles Using References
Calli, Cagatay (2009-09-16)
Classifying documents as protein-protein interaction (PPI) relevant or not is the first step towards extracting meaningful PPI data from article content. Currently, this classification step is handled manually by expert curators. A number of text-mining methods have been proposed to tackle this problem, using abstracts without references. We propose that article references contain important information that can be used to enhance these previous techniques. We trained an SVM classifier solely based on refere...
Evaluating eReverse auctions (EeRA): A case research note
Hackney, Ray; Loesch, Andrea; Irani, Zahir; Ghoneim, Ahmad; Özkan Yıldırım, Sevgi (2007-03-01)
Purpose – To evaluate issues relating to the implementation of electronic reverse auctions (eRA) within local government procurement processes. Design/methodology/approach – The methodology is a structured case analysis approach to enable qualitative data to be modelled through a visual toolset simulation. Findings – The paper identifies a set of business scenarios to demonstrate the impact of different eRA strategies in this respect. Practical implications – The case research described in this paper propos...
Citation Formats
Y. B. Ulu and İ. S. Altıngövde, “Predicting the size of candidate document set for implicit web search result diversification,” 2020, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/58018.