Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Utilizing Word Embeddings for Result Diversification in Tweet Search
Date
2015-12-04
Author
Onal, Kezban Dilek
Altıngövde, İsmail Sengör
Karagöz, Pınar
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
204
views
0
downloads
Cite This
The performance of result diversification for tweet search suffers from the well-known vocabulary mismatch problem, as tweets are too short and usually informal. As a remedy, we propose to adopt a query and tweet expansion strategy that utilizes automatically-generated word embeddings. Our experiments using state-of-the-art diversification methods on the Tweets2013 corpus reveal encouraging results for expanding queries and/or tweets based on the word embeddings to improve the diversification performance in tweet search. We further show that the expansions based on the word embeddings may serve as useful as those based on a manually constructed knowledge base, namely, ConceptNet.
Subject Keywords
Diversification
,
Expansion
,
Tweet search
,
Word embedding
URI
https://hdl.handle.net/11511/38301
DOI
https://doi.org/10.1007/978-3-319-28940-3_29
Collections
Department of Computer Engineering, Conference / Seminar
Suggestions
OpenMETU
Core
Semantic Expansion of Hashtags for Enhanced Event Detection in Twitter
Özdikiş, Özer; Karagöz, Pınar; Oğuztüzün, Mehmet Halit Seyfullah (2012-09-09)
In this work, we present an event detection method in Twitter based on clustering of hashtags and introduce an enhancement technique by using the semantic similarities between the hashtags. To this aim, we devised two methods for tweet vector generation and evaluated their effect on clustering and event detection performance in comparison to word-based vector generation methods. By analyzing the contexts of hashtags and their co-occurrence statistics with other words, we identify their paradigmatic relation...
Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales
Arslan, Yusuf; Kucuk, Dilek; Birtürk, Ayşe Nur (2018-06-15)
Sentiment analysis is a popular research topic in social media analysis and natural language processing. In this paper, we present the details and evaluation results of our Twitter sentiment analysis experiments which are based on word embeddings vectors such as word2vec and doc2vec, using an ANN classifier. In these experiments, we utilized two publicly available sentiment analysis datasets and four smaller datasets derived from these datasets, in addition to a publicly available trained vector model over ...
User Interest Modeling in Twitter with Named Entity Recognition
Karatay, Deniz; Karagöz, Pınar (null; 2015-05-18)
Considering wide use of Twitter as the source of information, reaching an interesting tweet for a user among a bunch of tweets is challenging. In this work we propose a Named Entity Recognition (NER) based user profile modeling for Twitter users and employ this model to generate personalized tweet recommendations. Effectiveness of the proposed method is shown through a set of experiments. Copyright © 2015 held by author(s).
Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter
Ozdikis, Ozer; Karagöz, Pınar; Oğuztüzün, Mehmet Halit S. (2012-08-29)
This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The enhancement technique we propose is based on lexico-semantic expansion of tweet contents while applying document similarity and clustering algorithms. Considering the length limitations and idiosyncratic spelling in Twitter environment, it is possible to take advantage of word similarities and to enrich texts with similar words. The semantic expansion technique we implement is based on syntagmatic and paradi...
Result Diversification for Tweet Search
Ozsoy, Makbule Gulcin; Onal, Kezban Dilek; Altıngövde, İsmail Sengör (2014-10-14)
Being one of the most popular microblogging platforms, Twitter handles more than two billion queries per day. Given the users' desire for fresh and novel content but their reluctance to submit long and descriptive queries, there is an inevitable need for generating diversified search results to cover different aspects of a query topic. In this paper, we address diversification of results in tweet search by adopting several methods from the text summarization and web search domains. We provide an exhaustive ...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
K. D. Onal, İ. S. Altıngövde, and P. Karagöz, “Utilizing Word Embeddings for Result Diversification in Tweet Search,” 2015, vol. 9460, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/38301.