Evaluation and optimization of frequent association rule based classification
Deriving useful and interesting rules from a data mining system is an essential and important task. Problems such as the discovery of random and coincidental patterns or patterns with no significant values, and the generation of a large volume of rules from a database commonly occur. Works on sust...
محفوظ في:
المؤلفون الرئيسيون: | , |
---|---|
التنسيق: | مقال |
اللغة: | English |
منشور في: |
Penerbit Universiti Kebangsaan Malaysia
2014
|
الوصول للمادة أونلاين: | http://journalarticle.ukm.my/6804/1/4801-11319-1-PB.pdf http://journalarticle.ukm.my/6804/ http://ejournal.ukm.my/apjitm |
الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
id |
my-ukm.journal.6804 |
---|---|
record_format |
eprints |
spelling |
my-ukm.journal.68042016-12-14T06:42:14Z http://journalarticle.ukm.my/6804/ Evaluation and optimization of frequent association rule based classification Izwan Nizal Mohd Shaharanee, Jastini Jamil, Deriving useful and interesting rules from a data mining system is an essential and important task. Problems such as the discovery of random and coincidental patterns or patterns with no significant values, and the generation of a large volume of rules from a database commonly occur. Works on sustaining the interestingness of rules generated by data mining algorithms are actively and constantly being examined and developed. In this paper, a systematic way to evaluate the association rules discovered from frequent itemset mining algorithms, combining common data mining and statistical interestingness measures, and outline an appropriated sequence of usage is presented. The experiments are performed using a number of real-world datasets that represent diverse characteristics of data/items, and detailed evaluation of rule sets is provided. Empirical results show that with a proper combination of data mining and statistical analysis, the framework is capable of eliminating a large number of non-significant, redundant and contradictive rules while preserving relatively valuable high accuracy and coverage rules when used in the classification problem. Moreover, the results reveal the important characteristics of mining frequent itemsets, and the impact of confidence measure for the classification task. Penerbit Universiti Kebangsaan Malaysia 2014-06 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/6804/1/4801-11319-1-PB.pdf Izwan Nizal Mohd Shaharanee, and Jastini Jamil, (2014) Evaluation and optimization of frequent association rule based classification. Asia-Pacific Journal of Information Technology and Multimedia, 3 (1). pp. 1-13. ISSN 2289-2192 http://ejournal.ukm.my/apjitm |
institution |
Universiti Kebangsaan Malaysia |
building |
Perpustakaan Tun Sri Lanang Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Kebangsaan Malaysia |
content_source |
UKM Journal Article Repository |
url_provider |
http://journalarticle.ukm.my/ |
language |
English |
description |
Deriving useful and interesting rules from a data mining system is an essential and important task. Problems
such as the discovery of random and coincidental patterns or patterns with no significant values, and the
generation of a large volume of rules from a database commonly occur. Works on sustaining the interestingness
of rules generated by data mining algorithms are actively and constantly being examined and developed. In this
paper, a systematic way to evaluate the association rules discovered from frequent itemset mining algorithms,
combining common data mining and statistical interestingness measures, and outline an appropriated sequence of usage is presented. The experiments are performed using a number of real-world datasets that represent diverse characteristics of data/items, and detailed evaluation of rule sets is provided. Empirical results show that with a proper combination of data mining and statistical analysis, the framework is capable of eliminating a large number of non-significant, redundant and contradictive rules while preserving relatively valuable high accuracy and coverage rules when used in the classification problem. Moreover, the results reveal the important characteristics of mining frequent itemsets, and the impact of confidence measure for the classification task. |
format |
Article |
author |
Izwan Nizal Mohd Shaharanee, Jastini Jamil, |
spellingShingle |
Izwan Nizal Mohd Shaharanee, Jastini Jamil, Evaluation and optimization of frequent association rule based classification |
author_facet |
Izwan Nizal Mohd Shaharanee, Jastini Jamil, |
author_sort |
Izwan Nizal Mohd Shaharanee, |
title |
Evaluation and optimization of frequent association rule based classification |
title_short |
Evaluation and optimization of frequent association rule based classification |
title_full |
Evaluation and optimization of frequent association rule based classification |
title_fullStr |
Evaluation and optimization of frequent association rule based classification |
title_full_unstemmed |
Evaluation and optimization of frequent association rule based classification |
title_sort |
evaluation and optimization of frequent association rule based classification |
publisher |
Penerbit Universiti Kebangsaan Malaysia |
publishDate |
2014 |
url |
http://journalarticle.ukm.my/6804/1/4801-11319-1-PB.pdf http://journalarticle.ukm.my/6804/ http://ejournal.ukm.my/apjitm |
_version_ |
1643736901239177216 |
score |
13.154905 |