An analysis of peculiarity oriented interestingness measures on medical data

Download
2008
Aldaş, Cem Nuri
Peculiar data are regarded as patterns which are significantly distinguishable from other records, relatively few in number and they are accepted as to be one of the most striking aspects of the interestingness concept. In clinical domain, peculiar records are probably signals for malignancy or disorder to be intervened immediately. The investigation of the rules and mechanisms which lie behind these records will be a meaningful contribution for improved clinical decision support systems. In order to discover the most interesting records and patterns, many peculiarity oriented interestingness measures, each fulfilling a specific requirement, have been developed. In this thesis well-known peculiarity oriented interestingness measures, Local Outlier Factor (LOF), Cluster Based Local Outlier Factor (CBLOF) and Record Peculiar Factor (RPF) are compared. The insights derived from the theoretical infrastructures of the algorithms were evaluated by using experiments on synthetic and real world medical data. The results are discussed based on the interestingness perspective and some departure points for building a more developed methodology for knowledge discovery in databases are proposed.

Suggestions

A Shrinkage Approach for Modeling Non-Stationary Relational Autocorrelation
Angın, Pelin (2008-12-19)
Recent research has shown that collective classification in relational data often exhibit significant performance gains over conventional approaches that classify instances individually. This is primarily due to the presence of autocorrelation in relational datasets, meaning that the class labels of related entities are correlated and inferences about one instance can be used to improve inferences about linked instances. Statistical relational learning techniques exploit relational autocorrelation by modeli...
A pattern classification approach for boosting with genetic algorithms
Yalabık, Ismet; Yarman Vural, Fatoş Tunay; Üçoluk, Göktürk; Şehitoğlu, Onur Tolga (2007-11-09)
Ensemble learning is a multiple-classifier machine learning approach which produces collections and ensembles statistical classifiers to build up more accurate classifier than the individual classifiers. Bagging, boosting and voting methods are the basic examples of ensemble learning. In this study, a novel boosting technique targeting to solve partial problems of AdaBoost, a well-known boosting algorithm, is proposed. The proposed system finds an elegant way of boosting a bunch of classifiers successively ...
A similarity-based approach for shape classification using Asian skeletons
Erdem, Aykut; Tarı, Zehra Sibel (Elsevier BV, 2010-10-01)
Shape skeletons are commonly used in generic shape recognition as they capture part hierarchy, providing a structural representation of shapes. However, their potential for shape classification has not been investigated much. In this study, we present a similarity-based approach for classifying 2D shapes based on their Asian skeletons (Asian and Tan, 2005; Aslan et al., 2008). The coarse structure of this skeleton representation allows us to represent each shape category in the form of a reduced set of prot...
An approach to the mean shift outlier model by Tikhonov regularization and conic programming
TAYLAN, PAKİZE; Yerlikaya-Oezkurt, Fatma; Weber, Gerhard Wilhelm (IOS Press, 2014-01-01)
In statistical research, regression models based on data play a central role; one of these models is the linear regression model. However, this model may give misleading results when data contain outliers. The outliers in linear regression can be resolved in two stages: by using the Mean Shift Outlier Model (MSOM) and by providing a new solution for this model. First, we construct a Tikhonov regularization problem for the MSOM. Then, we treat this problem using convex optimization techniques, specifically c...
Investigation of Planar Barrier Discharges for Coherent Nonlinear Structures
Uzun Kaymak, İlker Ümit (2016-09-09)
Nonlinear pattern formations are ubiquitous in nature. One of the analogous configurations in laboratory experiments to such nonlinear systems is the current filament formations observed in glow plasmas. These filaments can generate oscillatory fluctuations in glow, which are also observed in voltage and current measurements. Specifically, semiconductor-gas discharges are known to breed these types of current filaments naturally. The plasma discharge is initiated by applying a DC high voltage to electrodes ...
Citation Formats
C. N. Aldaş, “An analysis of peculiarity oriented interestingness measures on medical data,” M.S. - Master of Science, Middle East Technical University, 2008.