An Effective approach for comparison of association rule mining algorithms based on controlled data, statistical inference and multiple criteria

Download
2016
Azadiamin, Sanam
Association rules are an important set of data mining results, which are helpful in handling large amount of data and extracting useful association information from them. There are many algorithms developed for finding interesting association rules and also some other algorithms for rule reduction purposes. All of the proposed methods have some strong and weak points, which can be useful according to their application areas. In the literature, there exist several comparison studies trying to find the best algorithm according to the user’s interests. But every comparison approach considers these algorithms using different measures, and it is hard to assess performance of an algorithm with respect to a measure since interesting association rules are unknown. A novel comparison method has been proposed by Jabarnejad (2010) based on interesting rules generated by logistic regression to compare rule reduction algorithms. In this study, this approach is extended to cover all association rule mining algorithms, on a broader set of test data developing and using relevant vi comparison measures. This approach utilizes design and analysis of experiments to generate test data. Furthermore, it defines several comparison measures, and the dependency and importance of these measures are analyzed using statistical methods such as factor analysis, ANOVA and nonparametric hypothesis tests. Finally, if statistical analyses show significant differences between applied association rule mining methods, it handles multiple comparisons using PROMETHEE. The approach is demonstrated by comparing three association rule mining algorithms. The results are discussed and future research directions are presented.

Suggestions

Fuzzy association rule mining from spatio-temporal data
Calargun, Seda Unal; Yazıcı, Adnan (2008-07-03)
The use of fuzzy sets in mining association rules from spatio-temporal databases is useful since fuzzy sets are able to model the uncertainty embedded in the meaning of data. There are several fuzzy association rule mining techniques that can work on spatio-temporal data. Their ability to mine fuzzy association rules has to be compared on a realistic scenario. Besides the performance criteria, other criteria that can express the quality of an association rule discovered shall be specified. In this paper, fu...
Optimization of an online course with web usage mining
Akman, LE; Akkan, B; Baykal, Nazife (2004-02-18)
The huge amount of information existing in the World Wide Web constitutes an ideal environment to implement data mining techniques. Web mining is the mining of web data. There are different applications of web mining: web content mining, web structure mining and web usage mining. In our study we analyzed an online course by web usage mining techniques in order to optimize the navigation paths, the duration of the time spend on each page and the number of visits throughout the semester of the course. Moreove...
Recent Advances in Optimization Models for Data Mining: Clustering, Feature Selection and Classification
Fan, Y-j; İyigün, Cem; Chaovalitwongse , W. A. (American Mathematical Society, 2008-09-01)
Data mining aims at finding interesting, useful or profitable information in very large databases. The enormous increase in the size of available scientific and commercial databases (data avalanche) as well as the continuing and exponential growth in performance of present day computers make data mining a very active field. In many cases, the burgeoning volume of data sets has grown so large that it threatens to overwhelm rather than enlighten scientists. Therefore, traditional methods are revised and strea...
Improving the scalability of ILP-based multi-relational concept discovery system through parallelization
Mutlu, Ayşe Ceyda; Karagöz, Pınar; Kavurucu, Yusuf (2012-03-01)
Due to the increase in the amount of relational data that is being collected and the limitations of propositional problem definition in relational domains, multi-relational data mining has arisen to be able to extract patterns from relational data. In order to cope with intractably large search space and still to be able to generate high-quality patterns. ILP-based multi-relational data mining and concept discovery systems employ several search strategies and pattern limitations. Another direction to cope w...
Discovering fuzzy spatial association rules
Kacar, E; Cicekli, NK (2002-04-04)
Discovering interesting, implicit knowledge and general relationships in geographic information databases is very important to understand and use these spatial data. One of the methods for discovering this implicit knowledge is mining spatial association rules. A spatial association rule is a rule indicating certain association relationships among a set of spatial and possibly non-spatial predicates. In the mining process, data is organized in a hierarchical manner. However, in real-world applications it ma...
Citation Formats
S. Azadiamin, “An Effective approach for comparison of association rule mining algorithms based on controlled data, statistical inference and multiple criteria,” M.S. - Master of Science, Middle East Technical University, 2016.