Show/Hide Menu
Hide/Show Apps
anonymousUser
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Açık Bilim Politikası
Açık Bilim Politikası
Frequently Asked Questions
Frequently Asked Questions
Browse
Browse
By Issue Date
By Issue Date
Authors
Authors
Titles
Titles
Subjects
Subjects
Communities & Collections
Communities & Collections
The effect of data set characteristics on the choice of clustering validity index type
Date
2007-11-09
Author
Taşkaya Temizel, Tuğba
Mizani, Mehrdad A.
Inkaya, Tulin
Yucebas, Sait Can
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
3
views
0
downloads
Clustering techniques are widely used to give insight about the similarities/dissimilarities between data set items. Most algorithms require the user to tune parameters such as number of clusters or threshold for cut-off point in a dendrogram. Such parameters also affect the clustering quality. In a good quality cluster, the intra-cluster similarity should be high, whereas the inter-cluster similarity should be low. To determine the optimal cluster number, several cluster validity methods have been proposed. However, there is no guideline with respect to which clustering validity methods can be used in conjunction with which clustering algorithms. In this paper, Dunn and SD validity indices were applied to Kohonen self organizing maps, k-means and agglomerative clustering algorithms and their limitations were shown empirically.
Subject Keywords
Educational institutions
,
Clustering algorithms
,
Cities and towns
,
Informatics
,
Self organizing feature maps
,
Frequency
,
Partitioning algorithms
,
Employment
,
Cleaning
,
Industrial engineering
URI
https://hdl.handle.net/11511/31546
DOI
https://doi.org/10.1109/iscis.2007.4456856
Collections
Graduate School of Informatics, Conference / Seminar