A framework for information quality and coverageassessment for type 2 diabetes websites

Date

2020-10-22

Author

Ölçer, Didem

Metadata

Show full item record

Item Usage Stats

362
views

0
downloads

Health information seekers often use search engines to access high-quality and up-to-date information. However, finding online high-quality health information is increasingly getting difficult due to the high volume of information generated by non-experts in the area. There are manual tools which help end-users in assessing the quality of websites. However, they are labour-intensive. This thesis aims to propose a framework that automatically evaluates the content coverage and quality of health websites according to evidence-based medicine. The thesis has two main contributions. The first one is a method which utilizes quality indicators derived from professional health literacy guidelines to measure information quality. The second contribution includes a method which uses both textual and content-based features with Okapi BM25 and MeSH term expansion to assess information coverage and information quality. Content-based features were acquired using American Diabetes Association’s (ADA) guideline, which is an evidence-based practice guideline in diabetes. Specifically, sentences containing auxiliary verbs from ADA guideline were extracted and the weirdness coefficient of terms, 2-grams and 3-grams generated from these sentences were calculated using iWeb corpus. The results showed that the use of both textual and content-based features is effective in classification of high and low-quality websites. In addition, the features derived from professional health guidelines lead to a significant positive impact in classification results.

Subject Keywords

DISCERN, Website quality, Quality features, Information retrieval

URI

https://hdl.handle.net/11511/69100

Collections

Graduate School of Informatics, Thesis

Suggestions

OpenMETU
Core

A framework for automatic information quality ranking of diabetes websites Saglam, Rahime Belen; Taşkaya Temizel, Tuğba (2015-01-01) Objective: When searching for particular medical information on the internet the challenge lies in distinguishing the websites that are relevant to the topic, and contain accurate information. In this article, we propose a framework that automatically identifies and ranks diabetes websites according to their relevance and information quality based on the website content.
Quality assessment of web-based information on type 2 diabetes Olcer, Didem; Taşkaya Temizel, Tuğba (2021-10-01) Purpose This paper proposes a framework that automatically assesses content coverage and information quality of health websites for end-users. Design/methodology/approach The study investigates the impact of textual and content-based features in predicting the quality of health-related texts. Content-based features were acquired using an evidence-based practice guideline in diabetes. A set of textual features inspired by professional health literacy guidelines and the features commonly used for assessing in...
Quality oriented information retrieval and timeliness analysis on diabetes websites / Belen Sağlam, Rahime; Taşkaya Temizel, Tuğba; Department of Information Systems (2014) The foremost requirement of health information seekers is to retrieve high quality and up-to date information from web search engine results. Current techniques rely heavily on Web graph structure and they are domain independent solutions. However, in health domain, to ensure information quality, a search engine should return results that are not only relevant to submitted query but also in accordance with evidence based medical guidelines. The aim of this thesis is to propose an automated framework which i...
An Approach for quality control chart appropriateness evaluation based on desirability functions Tunç, Sıdıka; Köksal, Gülser; Department of Industrial Engineering (2016) Quality control charts are among the oldest and most powerful tools in statistical process control. Several control charts have been developed for specific needs and characteristics of processes. However, their proper implementation requires expert knowledge about statistics and properties of these charts. In this study, an effective approach is developed to evaluate appropriateness of control charts for the process to be monitored and the process owner. This approach can be used to recommend the most appro...
Characterizing web search queries that match very few or no results Altıngövde, İsmail Sengör; Cambazoglu, Berkant Barla; Ozcan, Rifat; Sarigil, Erdem; Ulusoy, Özgür (2012-12-19) Despite the continuous efforts to improve the web search quality, a non-negligible fraction of user queries end up with very few or even no matching results in leading web search engines. In this work, we provide a detailed characterization of such queries based on an analysis of a real-life query log. Our experimental setup allows us to characterize the queries with few/no results and compare the mechanisms employed by the major search engines in handling them.

Citation Formats

D. Ölçer, “A framework for information quality and coverageassessment for type 2 diabetes websites,” Ph.D. - Doctoral Program, Middle East Technical University, 2020.