Abstract or Full-text in Topic Modeling? Konu Modellemede Özet mi Tam Metin mi?

2022-01-01
Tekin, Yasar
Coşar, Ahmet
Topic modeling is a text mining technique used for automatic extraction of topics addressed in document collections. Although there are different topic models proposed by researchers, the most preferred one is Latent Dirichlet Allocation (LDA). Despite such widespread use, uncertainties about LDA have not been fully resolved yet. In this study, the effect of using abstracts or full-text articles on LDA model parameters is investigated. For this purpose, LDA parameters are optimized on abstracts and full-texts of articles published in two different scientific journals and the results obtained are compared with each other.
30th Signal Processing and Communications Applications Conference, SIU 2022

Suggestions

APPLICATION OF TEXT MINING TO TECHNOLOGY MANAGEMENT DOMAIN TO EXTRACT TOPICS AND TRENDS
Tekin, Yaşar; Karagöz, Pınar; Department of Science and Technology Policy Studies (2022-1-17)
Topic modeling is a widely used technique to extract latent topics from large document collections. One of the most remarkable uses of it is its application to scientific fields. If topic modeling is applied to all articles published in a specific scientific field, it provides an overall view of topics and trends for the time period under consideration. If it is applied to a single conference or journal, it reveals differences from global trends. The most popular method used for topic modeling is Latent Dir...
K-median clustering algorithms for time series data
Gökçem, Yiğit; İyigün, Cem; Department of Industrial Engineering (2021-3-10)
Clustering is an unsupervised learning method, that groups the unlabeled data forgathering valuable information. Clustering can be applied on various types of data. Inthis study, we have focused on time series clustering. When the studies about timeseries clustering are reviewed in the literature, for the time series data, the centers ofthe formed clusters are selected from the existing time series samples in the clusters.In this study, we have changed that view and have proposed clustering algorithmsbased...
Utilization of feature modeling in axiomatic design
Üçtepe, Orhan; Doğru, Ali Hikmet; Department of Computer Engineering (2008)
This thesis provides an approach to use feature modeling with a set of guidelines for requirements definition and decomposition activities of the axiomatic design methodology. A tool that supports the development of feature models and modeling of the Axiomatic Design activities is implemented to be utilized for guiding the designer. Axiomatic Design suggested four domains of information in the transformation of the problem definition to the solution, and provided mechanisms for supporting the mapping among ...
Clustering of manifold-modeled data based on tangent space variations
Gökdoğan, Gökhan; Vural, Elif; Department of Electrical and Electronics Engineering (2017)
An important research topic of the recent years has been to understand and analyze data collections for clustering and classification applications. In many data analysis problems, the data sets at hand have an intrinsically low-dimensional structure and admit a manifold model. Most state-of-the-art clustering methods developed for data of non-linear and low-dimensional structure are based on local linearity assumptions. However, clustering algorithms based on locally linear representations can tolerate diff...
A novel and precise false positive probability computation for Bloom Filters implemented with universal hash functions
Koltuk, Furkan; Schmidt, Şenan Ece; Department of Electrical and Electronics Engineering (2022-8-22)
Bloom Filters (BF) are multiple-hashing data structures that are widely used in membership testing applications. The many-to-one nature of the BF hashing results in false positive outcomes which have to be further processed at a performance cost. The computation of the false positive probability of BFs is carried out under the assumption of uniform and independent hash functions. To the best of our knowledge, all previous work in the literature assume that the hash functions are uniform and independent with...
Citation Formats
Y. Tekin and A. Coşar, “Abstract or Full-text in Topic Modeling? Konu Modellemede Özet mi Tam Metin mi?,” presented at the 30th Signal Processing and Communications Applications Conference, SIU 2022, Safranbolu, Türkiye, 2022, Accessed: 00, 2023. [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85138679793&origin=inward.