Exploiting Maximal Emerging Patterns for Classification
详细信息
下载全文
推荐本文 |
摘要
Classification is an important data mining problem. Emerging Patterns (EPs) are itemsets whose supports change significantly from one data class to another. Previous studies have shown that classifiers based on EPs are competitive to other state-of-the-art classification systems. In this paper, we propose a new type of Emerging Patterns, called Maximal Emerging Patterns (MaxEPs), which are the longest EPs satisfying certain constraints. MaxEPs can be used to condense the vast amount of information, resulting in a significantly smaller set of high quality patterns for classification. We also develop a new overlapping or intersection based mechanism to exploit the properties of MaxEPs. Our new classifier, Classification by Maximal Emerging Patterns (CMaxEP), combines the advantages of the Bayesian approach and EP-based classifiers. The experimental results on 36 benchmark datasets from the UCI machine learning repository demonstrate that our method has better overall classification accuracy in comparison to JEP-classifier, CBA, C5.0 and NB. Keywords: Emerging Patterns, classification, Bayesian learning, maximal Emerging Patterns.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700