Named entity recognition from scratch on social media

2015-09-07
Önal, Kezban Dilek
Karagöz, Pınar
With the extensive amount of textual data flowing through social media platforms, the interest in Information Extraction (IE) on such textual data has increased. Named Entity Recognition (NER) is one of the basic problems of IE. State-of-the-art solutions for NER face an adaptation problem to informal texts from social media platforms. In this study, we addressed this generalization problem with the NLP from scratch idea that has been shown to be successful for several NLP tasks on formal text. Experimental results have shown that word embeddings can be successfully used for NER on informal text.
International Workshop on Mining Ubiquitous and Social Environments (MUSE 2015) ECML-PKDD, MUSE Workshop, (07 Eylül 2015)

Suggestions

Detecting User Emotions in Twitter through Collective Classification
İLERİ, İBRAHİM; Karagöz, Pınar (2016-11-11)
The explosion in the use of social networks has generated a big amount of data including user opinions about varying subjects. For classifying the sentiment of user postings, many text-based techniques have been proposed in the literature. As a continuation of sentiment analysis, there are also studies on the emotion analysis. Due to the fact that many different emotions are needed to be dealt with at this point, the problem gets more complicated as the number of emotions to be detected increases. In this s...
Identifying textual personal information using bidirectional LSTM networks
Ertekin Bolelli, Şeyda (2018-07-09)
Data-driven approaches based on the data collected from individuals are improving everyday life as a result of the developments in big data studies. Prior to developing such an approach, removal of personal information from the data is important since personal information contained in data would jeopardize people's privacy and may harm related individuals. Especially in the field of health sciences, identifying personal information in the collected data is a difficult task as most of the data collected in h...
A Context Aware Notification Architecture Based on Distributed Focused Crawling in the Big Data Era
AKYOL, MEHMET ALİ; Gökalp, Mert Onuralp; Kayabay, Kerem; Eren, Pekin Erhan; Koçyiğit, Altan (2017-09-08)
The amount of data created in various sources over the Web is tremendously increasing. Trying to keep track of relevant sources is an increasingly time-consuming task. The traditional way of accessing information over the Web is pull-based. Users need to query data sources in certain time intervals where an important piece of information can be lately recognized or even missed completely. Technologies including RSS help users to get push-based notifications from websites. Discovering the relevant informatio...
Real-Time Lexicon-Based Sentiment Analysis Experiments On Twitter With A Mild (More Information, Less Data)
Arslan, Yusuf; Birtürk, Ayşe Nur; Djumabaev, Bekjan; Kucuk, Dilek (2017-12-14)
Sentiment analysis of Twitter data is a well studied area, however, there is a need for exploring the effectiveness of real-time approaches on small data sets that only include popular and targeted tweets. In this paper, we have employed several sentiment analysis techniques by using dynamic dictionaries and models, and performed some experiments on limited but relevant datasets to understand the popularity of some terms and the opinion of users about them. The results of our experiments are promising.
Collective classification of user emotions in twitter
İleri, İbrahim; Karagöz, Pınar; Department of Computer Engineering (2015)
The recent explosion of social networks has generated a big amount of data including user opinions about varied subjects. For classifying the sentiment of user postings, many text-based techniques have been proposed in the literature. As a continuation of sentiment analysis, there are also studies on the emotion analysis. Because of the fact that many different emotions are needed to be dealt with at this point, the problem becomes much more complicated. In this thesis, a different user-centric approach is ...
Citation Formats
K. D. Önal and P. Karagöz, “Named entity recognition from scratch on social media,” presented at the International Workshop on Mining Ubiquitous and Social Environments (MUSE 2015) ECML-PKDD, MUSE Workshop, (07 Eylül 2015), Porto, Portugal, 2015, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/88175.