Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Early Detection of Fake News on Emerging Topics Through Weak Supervision
Download
index.pdf
Date
2023-9
Author
Akdağ, Serhat Hakkı
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
103
views
111
downloads
Cite This
In this thesis, we present a novel solution to the early detection of fake news problem on emerging topics through weak supervision. Traditional techniques rely on fact-checkers or supervised learning with labeled data, which is not readily available for emerging topics. To address this, we introduce end-to-end Weakly Supervised Text Classification framework, WeSTeC, to programmatically label a large-scale text dataset of a particular domain and train supervised text classifiers with the assigned labels. The proposed framework combines multiple weak labeling strategies and aggregates the generated weak labels into a single weak label per data instance. The generated labels are then used to fine tune a pre-trained RoBERTa classifier for fake news detection. By using the weakly labeled dataset containing fake news related to the emerging topic, the trained fake news detection model becomes specialized for the topic at hand. We consider both semi-supervision and domain adaptation setups, utilizing small amounts of labeled data and labeled data from other domains respectively. The proposed model is evaluated on both the quality of aggregated weak labels generated and the fake news detection classifier. In both evaluations, the model outperforms all baselines in each setup considered. In addition, when compared to the fully supervised counterpart, the fake news detection model trained on weak labels achieves an accuracy as close as 0.1\%, showing the effectiveness of the weak labeling module of the proposed framework.
Subject Keywords
Fake news detection
,
Weakly supervised learning
,
Text classification
,
Language models
URI
https://hdl.handle.net/11511/105390
Collections
Graduate School of Natural and Applied Sciences, Thesis
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
S. H. Akdağ, “Early Detection of Fake News on Emerging Topics Through Weak Supervision,” M.S. - Master of Science, Middle East Technical University, 2023.