Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter

Ozdikis, Ozer
Karagöz, Pınar
Oğuztüzün, Mehmet Halit S.
This paper aims to enhance event detection methods in a micro-blogging platform, namely Twitter. The enhancement technique we propose is based on lexico-semantic expansion of tweet contents while applying document similarity and clustering algorithms. Considering the length limitations and idiosyncratic spelling in Twitter environment, it is possible to take advantage of word similarities and to enrich texts with similar words. The semantic expansion technique we implement is based on syntagmatic and paradigmatic relationships between words, extracted from their co-occurrence statistics. As our technique does not depend on an existing ontology or a lexicon database such as WordNet, it should be applicable for any language. The proposed technique is applied on a tweet set collected for three days from the users in Turkey. The results indicate earlier detection of events and improvements in accuracy.


Semantic Expansion of Hashtags for Enhanced Event Detection in Twitter
Özdikiş, Özer; Karagöz, Pınar; Oğuztüzün, Mehmet Halit Seyfullah (2012-09-09)
In this work, we present an event detection method in Twitter based on clustering of hashtags and introduce an enhancement technique by using the semantic similarities between the hashtags. To this aim, we devised two methods for tweet vector generation and evaluated their effect on clustering and event detection performance in comparison to word-based vector generation methods. By analyzing the contexts of hashtags and their co-occurrence statistics with other words, we identify their paradigmatic relation...
Word Embedding Based Event Detection on Social Media
Ertugrul, Ali Mert; Velioglu, Burak; Karagöz, Pınar (2017-06-23)
Event detection from social media messages is conventionally based on clustering the message contents. The most basic approach is representing messages in terms of term vectors that are constructed through traditional natural language processing (NLP) methods and then assigning weights to terms generally based on frequency. In this study, we use neural feature extraction approach and explore the performance of event detection under the use of word embeddings. Using a corpus of a set of tweets, message terms...
Event detection on social media using transaction based stream processing engine
Çınar, Hüseyin Alper; Karagöz, Pınar; Department of Computer Engineering (2019)
The aim of this study is detecting events on social media by improving current solutions in terms of accuracy and time performance. An event is something that occurs in a short duration of time in a certain place. In this thesis, the problem is modelled as a streaming transaction process. Three different event detection method is adapted to our solution. First one is the keyword-based event detection method that looks for bursty keywords in a period. The second one is the clustering-based event detection me...
Clustering based personality prediction on turkish tweets
Tutaysalgir, Esen; Karagöz, Pınar; Toroslu, İsmail Hakkı (2019-08-30)
In this paper, we present a framework for predicting the personality traits by analyzing tweets written in Turkish. The prediction model is constructed with a clustering based approach. Since the model is based on linguistic features, it is language specific. The prediction model uses features applicable to Turkish language and related to writing style of Turkish Twitter users. Our approach uses anonymous BIGS questionnaire scores of volunteer participants as the ground truth in order to generate personalit...
Resource management in cellular communication networks with subscriber profile prediction
Yilmaz, N; Ergul, R (2004-09-09)
in this paper, a dynamic resource management and subscriber profile prediction scheme for mobile communication systems is presented. Using this approach, a high level of QoS for both handoff calls and new calls, and better utilization of wireless network resources has been achieved. The simultaneous satisfaction of these two actually conflicting interests will be due to two key features. First, it is due to the subscriber profile predicted using subscriber data. This apriori information about possible hando...
Citation Formats
O. Ozdikis, P. Karagöz, and M. H. S. Oğuztüzün, “Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter,” 2012, Accessed: 00, 2020. [Online]. Available: