Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales

2018-06-15
Arslan, Yusuf
Kucuk, Dilek
Birtürk, Ayşe Nur
Sentiment analysis is a popular research topic in social media analysis and natural language processing. In this paper, we present the details and evaluation results of our Twitter sentiment analysis experiments which are based on word embeddings vectors such as word2vec and doc2vec, using an ANN classifier. In these experiments, we utilized two publicly available sentiment analysis datasets and four smaller datasets derived from these datasets, in addition to a publicly available trained vector model over 400 million tweets. The evaluation results are accompanied with discussions and future research directions based on the current study. One of the main conclusions drawn from the experiments is that filtering out the emoticons in the tweets could be a facilitating factor for sentiment analysis on tweets.

Suggestions

Real-Time Lexicon-Based Sentiment Analysis Experiments On Twitter With A Mild (More Information, Less Data)
Arslan, Yusuf; Birtürk, Ayşe Nur; Djumabaev, Bekjan; Kucuk, Dilek (2017-12-14)
Sentiment analysis of Twitter data is a well studied area, however, there is a need for exploring the effectiveness of real-time approaches on small data sets that only include popular and targeted tweets. In this paper, we have employed several sentiment analysis techniques by using dynamic dictionaries and models, and performed some experiments on limited but relevant datasets to understand the popularity of some terms and the opinion of users about them. The results of our experiments are promising.
Clustering based personality prediction on turkish tweets
Tutaysalgir, Esen; Karagöz, Pınar; Toroslu, İsmail Hakkı (2019-08-30)
In this paper, we present a framework for predicting the personality traits by analyzing tweets written in Turkish. The prediction model is constructed with a clustering based approach. Since the model is based on linguistic features, it is language specific. The prediction model uses features applicable to Turkish language and related to writing style of Turkish Twitter users. Our approach uses anonymous BIGS questionnaire scores of volunteer participants as the ground truth in order to generate personalit...
Clustering based personality prediction on Turkish Tweets
Tutaysalgır, Esen; Toroslu, İsmail Hakkı; Department of Computer Engineering (2019)
In this thesis, we present a framework for predicting the personality traits of users using their tweets written in Turkish. The prediction model is constructed with a clustering based approach. We show how to extract linguistic features from tweet data and to adapt TF-IDF weighting and word embeddings to the Turkish tweets. Since the model is based on linguistic features, it is language specific. The prediction model uses features applicable to Turkish language and related to writing style of Turkish Twitt...
Investigating sentimental relation between social media presence and academic success of Turkish Universities
Gunduz, Sedef; Demirhan, Fatih; SAĞIROĞLU, Şeref (2014-12-06)
In this study an approach that uses social networking data for developing sentiment analysis system is proposed. With the help of developed software, it is tried to find out whether there is any relation between universities' academic success and sentiment of the public about them in social media. After collecting enough text based data from Twitter, preprocessing of data is carried out and final data is trained by means of Naive Bayes Classifier. After testing process, experimental results have shown that ...
Tweetology of Learning Analytics: What does Twitter tell us about the trends and development of the field?
Khalil, Mohammad; Wong, Jacqueline; Er, Erkan; Heitmann, Martin; Belokrys, Gleb (2022-03-21)
Twitter is a very popular microblogging platform that has been actively used by scientific communities to exchange scientific information and to promote scholarly discussions. The present study aimed to leverage the tweet data to provide valuable insights into the development of the learning analytics field since its initial days. Descriptive analysis, geocoding analysis, and topic modeling were performed on over 1.6 million tweets related to learning analytics posted between 2010-2021. The descriptive anal...
Citation Formats
Y. Arslan, D. Kucuk, and A. N. Birtürk, “Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales,” 2018, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/76489.