基于有向项集图的关联规则挖掘算法研究与应用

英文题名：The Research and Application of Association Rules Mining Algorithms Based on Directed Itemset Graph
作者：温磊
论文级别：博士
学科专业名称：管理科学与工程
中文关键词：数据挖掘 ; 有向项集图 ; 关联规则 ; 频繁项集 ; 最大频繁项集 ; 频繁闭项集 ; 增量更新挖掘
英文关键词：Data Mining ; Directed Itemset Graph ; Association Rule ; Frequent Itemset ; Maximal Frequent Itemset ; Frequent Closed Itemset ; Incremental Update Mining
学位年度：2004
导师：李敏强
学科代码：1201
学位授予单位：天津大学
论文提交日期：2003-12-01

摘要

数据挖掘(Data Mining，简称DM)也叫数据库中的知识发现(Knowledge Discovery in Databases，简称KDD)，是指从大型的数据库中发现潜在的、新颖的、有价值的、可用的、能被用户理解的模式和信息的过程。关联规则挖掘是数据挖掘的一个重要的研究领域，主要是发现数据库中属性之间的关联关系。
    本文在广泛查阅国内外文献的基础上，针对关联规则挖掘算法的若干问题进行了深入地研究和分析，论文取得的主要成果和创新点如下：
    针对目前关联规则挖掘研究缺乏理论基础的问题，将数学中的格论和形式概念分析等理论引入关联规则挖掘研究中，有效地描述了关联规则挖掘的问题空间，并提出了基于形式概念分析理论的关联规则挖掘的一系列定义和性质。
    针对传统的频繁项集挖掘方法中存在的生成大量候选集、多次遍历数据库计算项集支持度等问题，本文以图论为基础提出了基于有向项集图的频繁项集挖掘算法。算法将原始数据库中的信息保存在有向项集图中，将数据库中的频繁项集发现问题转化为有向项集图中的搜索问题并保证了问题解的完整性。
    本文针对数据库中的最大频繁项集挖掘问题进行了分析和研究，本文提出了基于有向项集图的最大频繁项集挖掘算法。算法利用深度优先的搜索方法，通过计算候选项集的频繁扩展集可以有效地约减问题的搜索空间，提高了算法的效率。
    本文针对数据库中的频繁闭项集挖掘问题进行了分析和研究，提出了基于有向项集图的频繁闭项集挖掘算法。算法利用深度优先的搜索方法，利用频繁闭种子集的性质对搜索空间进行剪枝，可以有效地生成所有的频繁闭项集。
    针对现实数据库中数据不断更新的问题，本文研究了在最小支持度不变的情况下新增数据集后如何发现更新后的数据集中的频繁项集问题。提出了基于有向项集图的完全频繁项集增量更新挖掘算法、最大频繁项集增量更新挖掘算法和频繁闭项集增量更新挖掘算法。
    本文提出和设计的算法针对大规模稠密数据集进行了测试，证明了算法的有效性，并对电力生产的相关数据进行了应用尝试。
Data mining which is also referred as knowledge discovery in databases, means a process of finding nontrivial, extraction of implicit, pervious unknown and potential useful information from data in database. Association rules mining as an important field of data mining discover interesting relationships among attributes in those data.
    By reading the literature domestic and abroad, we research some problem of association rules mining algorithms，the main contexts and innovations are showed as follow:
    We discuss the relationship between lattice theory, formal concept analysis and association rules mining and introduc a series of definition and property of association rules mining .
    A new frequent itemset mining algorithms based on Directed Itemset Graph(DISG) is introduced. By storing information of frequent itemset in DISG. The problem of discovering the frequent itemset from database is transformed into the search problem of DISG .
     A new maximal frequent itemset mining algorithms based on DISG is introduced to discover the long frequent pattern. By using depth-first strategy, the algorithms prune the searching space by computing the frequent extension set of itemset and discover all the maximal frequent itemset efficiently.
    A new algorithms of mining frequent closed itemset based on DISG is introduced. By using depth-first strategy the algorithms prune the searching space by judging the property of frequent closed seedset and discover all the frequent close itemset efficiently.
     The mining algorithms of incremental update frequent itemset, incremental update maximal frequent itemset and incremental update frequent closed itemset are designed based on DISG,. These algorithms can efficiently utilize the result mined and discover the updated frequent itemset efficiently.
    The algorithms proposed in this paper is tested by using the large scale dense dataset which all show good performances. We make an application experiment with the dataset of power station and achieve some valuable information.

引文

[1]R. Agrawal, T. Imielinski, A. Swamy.Database Mining: A Performance Perspective. In IEEE Trans. on Knowl. and Data Engg., 1993. 5(6), 914-925.
    [2]Fayyad, U., Piatesky-Shapiro, G. and Smyth, P. from data mining to knowledge discovery: an overview,” In: Advances in knowledge discovery and data mining, AAAI/MIT Press, 1996, 1-34.
    [3]Fayyad U., Piatetsky-Shapiro G., and Smyth P.: Knowledge Discovery and Data Mining: Towards a Unifying Framework? In Proc. 2nd Int. Conf. on Knowledge Discovery and Data
    [4]J. Han and M. Kamber. Data Mining: Concepts and Techniques. MorganKaufmann Publishers, August 2000.
    [5]Chen M. S., Han J., and Yu P. S. Data Mining: An Overview from Database Perspective. IEEE Transactions on Knowledge and Data Engineering. December 1996, 8(6) 866-883.
    [6]H. Mannila, H. Toivonen, and A. I. Verkamo. Efficient algorithms for discovering association rules. Proc. AAAI’94 Workshop Knowledge Discovery in Databases (KDD94), Seattle, WA, 1994, 181-192
    [7]R. Srikant and R. Agrawal. Mining generalized association rules. Proc. 1995 Int. Conf. Very Large Data Bases (VLDB’95), Zurich, Switzerland, September 1995. 407-419,
    [8]Brin S., Motwani R. and Silverstein C. Beyond Market Baskets: Generalizing Association Rules to Correlations. Proceedings of the ACM SIGMOD, 1997. 265-276.
    [9]Klementtinen M., Mannila H., Ronkainen P., Toivonen H., and Verkamo A. I. Finding interesting rules from large sets of discovered association rules. Proceedings of the CIKM 1994.
    [10]Zaki M. J., Parthasarathy S., Ogihara M., Li W., "New Algorithms for Fast Discovery of Association Rules." KDD Conference Proceedings, 1997. 283-286,
    [11]邹晓峰, 陆建江，宋自林．基于模糊分类关联规则的分类系统，计算机研究与发展，2003，40(5) 651-656
    [12]黄金才,赵侠等. 基于高维空间划分的神经网络分类学习模型. 南京大学学报：自然科学版. 2003, 39(2). 194-204
    [13]郑建军,刘炜. 基于粗集的贝叶斯分类器算法. 北京理工大学学报.


    2003, 23(1). 83-86
    [14]王敞，陈增强，孙青林，袁著扯. 基于K中心方法的氨基酸序列聚类分析计算机工程, 2003 29(8) , 42-43
    [15]Zhang T., Ramakrishnan R., Linvy M.:BIRCH: An Efficient Data Clustering Method for Very Large Databases? In Proc. ACM SIGMOD Int. Conf. on Management of Data, 1996, 103-114.
    [16]行小帅, 焦李成. 数据挖掘的聚类方法, 电路与系统学报. 2003, 8(1). 59-67
    [17]汪闽, 周成虎. 一种带线性约束的最小生成树聚类方法. 模式识别与人工智能. 2002, 15(4). 494-497
    [18]Bouguettaya A.: On-Line Clustering? IEEE Transactions on Knowledge and Data Engineering, 1996 8( 2) , 333-339.
    [19]R. Agrawal, K. Lin, H. S. Sawhney, andK. Shim. Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases. Proc. of the 21st Int'l Conference on Very Large Databases, Zurich, Switzerland, September 1995.
    [20]史东辉, 张春阳等. 离群数据的挖掘方法研究. 小型微型计算机系统. 2001, 22(10). 1234-1236
    [21]杨欣斌, 孙京诰等. 基于蚁群聚类算法的离群挖掘方法. 计算机工程与应用. 2003, 39(9). 12-13,37
    [22]Han, J., Fu, Y., Wang, W., Koperski, K. and Zaiane, O. dMQL: a data mining query language for relational databases. Proceedings of the SIGMOD Workshop on Research Issues in Data
    [23]J. Han, Y. Fu, W. Wang, J. Chiang, W. Gong, K. Koperski, D. Li, Y. Lu, A. Rajan, N. Stefanovic, B. Xia and O. R. Zaiane. DBMiner: A system for mining knowledge in large relational databases. In Proc. International Conf. on Data Mining and Knowledge Discovery (KDD-96), Portland, Oregon, August 1996.
    [24]A. Siebes. Data mining systems development. Technical report, CWI,2000. (PKDD 2000).
    [25]杨炳儒, 熊范纶等. 利用标准SQL查询挖掘多值型关联规则及其评价. 计算机研究与发展. 2002, 39(3). 307-312
    [26]蒋良孝, 蔡之华. 空间数据挖掘的回顾与展望, 计算机工程. 2003, 29(6). 9-10,58
    [27]李德仁, 李德毅. 论空间数据挖掘和知识表现的理论与方法. 武汉大学学报：信息科学版. 2002, 27(3). 221-233
    [28]Wijsen, J., and Meersman, R. 1997. On the Complexity of Mining Temporal


    Trends. In Proceedings of SIGMOD'97 Workshop on Research Issues on Data Mining and Knowledge Discovery, SanJose, Claifornia.
    [29]宋擒豹, 沈钧毅. 基于关联规则的Web文档聚类算法.软件学报. 2002, 13(3). 417-423
    [30]Brian Dunkel and Nandit Soparkar. Data organization and access for efficient data mining. In Proceedings of the 15th ICDE Int. Conf. on Data Engineering, IEEE Computer Society.Sydney, Australia, 1999. 522--529,
    [31]Ciaccia P., Patella M., Zezula P.: M-tree: An Efficient Access Method for Similarity Search in Metric Spaces? In Proc. 23rd Int. Conf. on Very Large Data Bases, Athens, Greece, 1997, 426-435.
    [32]Balaji Padmanabhan and Alexander Tuzhilin. A belief-driven method for discovering unexpected patterns. In Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, 1998. 94-100,
    [33]Robert J. Hilderman and Howard J. Hamilton. Knowledge discovery and interestingness measures: Asurvey. Technical ReportCS 99-04, University of Regina, Regina, Saskatchewan, Canada, 1999.
    [34]施建强, 刘晓平. 基于遗传算法的数据挖掘技术的研究. 电脑与信息技术. 2003, 11(1). 9-14,36
    [35]吉根林, 孙志挥等. 应用人机交互的遗传算法挖掘股票投资风险规则. 小型微型计算机系统. 2002, 23(12). 1492-1495
    [36]周庆敏, 李永生. 基于粗集理论的数据挖掘应用. 南京工业大学学报：自然科学版. 2003, 25(2). 44-48
    [37]安海忠,王广祥等. 粗糙集知识发现的研究现状和展望. 计算机测量与控制. 2003, 11(2). 81-83,105
    [38]谭小萍, 柳炳祥. 数据挖掘在客户关系管理中的应用研究, 华东经济管理. 2003, 17(1), 145-147
    [39]张喆, 常桂然, 黄小原. 数据挖掘技术在CRM中的应用. 中国管理科学.2003, 11(1). 53-59
    [40]张捍东, 杨维翰. 数据挖掘技术在管理与决策中的应用. 华中科技大学学报：自然科学版. 2002, 30(6). 73-75
    [41]朱蔚恒, 陈健. 数据挖掘在电子商务中的应用. 计算机工程. 2002, 28(8). 73-74,113
    [42]张静, 田忠和. 基于ⅡS和web日志的关联关系的挖掘, 华中科技大学学报：自然科学版. 2002, 30(8). 37-39
    [43]罗敏, 张焕国, 王丽娜. 基于数据挖掘的网络入侵检测技术：研究综述, 计算机科学, 2003, 30(2), 105-108

    [44]刘康平, 李增智. 网络告警序列中的频繁情景规则挖掘算法, 小型微型计算机系统.2003, 24(5). 891-894
    [45]吴小明, 邱家驹, 张国江, 蔡建颖. 软计算方法和数据挖掘理论在电力系统负荷预测中的应用, 电力系统及其自动化学报. 2003, 15(1). 1-4,94
    [46]张彦霞, 赵永恒.天文学中的数据挖掘和知识发现. 天文学进展. 2002, 20(4). 312-323
    [47]李宝东, 宋瀚涛. 数据挖掘语言研究现状及发展. 计算机工程与应用. 2003, 39(6). 62-64,94
    [48]龚涛, 蔡自兴. 数据挖掘模型的比较研究. 控制工程（沈阳）. 2003, 10(2). 106-109,130
    [49]汪加才, 赵杰煜等. VISMiner:一个交互式可视化数据挖掘原型系统. 计算机工程. 2003, 29(1). 17-19
    [50]袁红春, 熊范纶. 一个适用于地理信息系统的数据挖掘工具－GISMiner. 中国科学技术大学学报. 2002, 32(2). 217-224
    [51]李天瑞,数据库中的关联规则及挖掘算法研究.?[博士论文],西南交通大学. 2001
    [52]R.Wille. restrucuring lattice theory: an approach based on hierarchies of concepts. In: Rival(ed.). ordered sets.Reidel, Dordrecht-boston 1982, 445-470
    [53]Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Databases, Santiago, Chile, (1994). 487-499
    [54]J. Han, J. Pei and Y. Yin. Mining Frequent Patterns without Candidate Generation. Proc. 2000 ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD00), Dallas, TX,USA, May 2000. 1-12,
    [55]Agarwal R. C., Aggarwal C. C., Prasad V. V. V., Crestana V., "A Tree Projection Algorithm for Generation of Large Itemsets For Association Rules." IBM Research Report, RC 21341.
    [56]Ganter, B. and Wille, R. (1999) Formal Concept Analysis: Mathematical Foundations Springer, Berlin.
    [57]G. Mineau and R. Godin. Automatic structuring of knowledge bases by conceptual clustering. IEEE Transactions on Knowledge and Data Engineering, 1995, 7(5), 824-829
    [58]S. Deogun, V. V. Raghavan, and H. Sever, Formal concept analysis and applications,” Tech. TRCSE-98-15, University Of Nebraska at Lincoln, Department of Computer Science, 1998. J.-P.
    [59]R. Godin, R. Missaoui, and H. Alaoui. Incremental concept formation


    algorithms based on galois (concept) lattices. Computational Intelligence, 1995.11(2), 246-267
    [60] R. Taouil, Y. Bastide, N. Pasquier, G. Stumme, and L. Lakhal. Mining bases for association rules based on formal concept analysis. In 16th IEEE Intl. Conf. on Data Engineering, Feb.
    [61] D. Cristofor, L. Cristofor, and D. Simovici. Galois connection and data mining. Journal of Universal Computer Science, 2000.6(1):60-73,
    [62]谢志鹏，刘宗田.. 概念格与关联规则发现. 计算机研究与发.2000　37(12). 1415-1421
    [63] 基于量化概念格的关联规则挖掘王德兴胡学钢等合肥工业大学学报：自然科学版.2002,25(5).-678-682
    [64]N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Discovering frequent closed itemsets for association rules. In Proc. 7th Int. Conf. Database Theory (ICDT'99), pages 398-416, Jerusalem, Israel, January 1999.
    [65] Agrawal R., Imielinski T., Swami A. Mining association rules between sets of items in very large databases. In Proceedings of the ACM SIGMOD Conference on Management of data, washington,USA,may,1993. 207-216
    [66] 陈富赞. 大型数据库中关联规则发现方法的研究?[博士论文]?天津大学. 2000
    [67] Karam Gouda and Mohammed J. Zaki. Efficiently mining maximal frequent itemsets. In 1st IEEE International Conference on Data Mining, November 2001.
    [68] A. Savasere, E. Omiecinski, and S. Navathe. An efficient algorithm for mining association rules in large databases. Proc. 1995 Int. Conf. Very Large Data Bases (VLDB’95), Zurich, Switzerland,(1995),p 432-443
    [69] J. S. Park, M. S. Chen, and P. S. Yu. An efficient hash-based algorithm for mining association rules. Proc. 1995 ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD’95), San Jose, CA, May 1995. 175-186
    [70] Brin S., Motwani R. Ullman J. D. and Tsur S. Dynamic Itemset Counting and implication rules for Market Basket Data. Proceedings of the ACM SIGMOD, 1997. 255-264.
    [71] 铁治欣. 关联规则采掘的研究. [博士论文]. 浙江大学.1999
    [72]R. Srikant and R. Agrawal. Mining quantitative association rules in large relational tables. Proc. 1996 ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD95), Montreal, Canada, June 1996. 1-12,
    [73]苑森淼,程晓青. 数量关联规则发现中的聚类方法研究. 计算机学报. 2000, 23(8). 866-871

    [74]J. Han and Fu. Discovery of multiple-level association rules from large databases. In VLDB-95, Zurich, Switzerland, September 1995.
    [75]T. Fukuda, Y. Morimoto, S. Morishita, and T. Tokuyama. Mining optimized association rules for numeric attributes. InProc. of the 15th ACM SIGACTSIGMOD-SIGART Symp. on Principles of Database Systems (PODS '96), Montreal, Canada, June 1996.
    [76]C.M. Kuok, A. Fu, and M.H. Wong. Fuzzy association rules in large databases with quantitative attributes. In ACM SIGMOD Records, March, 1998.
    [77] 段云峰, 李剑威. 基于数量的关联规则挖掘. 北京邮电大学学报. 2002, 25(4). 56-60
    [78]陈富赞,寇纪淞等. 基于网络的数值关联规则挖掘方法. 系统工程理论与实践. 2002, 22(4). 1-9
    [79]刘君强, 王勋. 多维多层关联规则有效挖掘的新算法. 南京大学学报：自然科学版. 2003, 39(2). 205-210
    [80]秦锋, 杨学兵. 一种基于APRIORI性质的多维关联规则挖掘算法的研究. 安徽工业大学学报. 2003, 20(2). 141-144
    [81]王文清, 乔雪峰. 带有时态约束的多层次关联规则的挖掘. 北京理工大学学报. 2003 ,23(1). 87-90
    [82] 程继华, 施鹏飞. 多层次关联规则的有效挖掘算法. 计算机学报. 1998, 21(11). 1037-1041
    [83]程继华,施鹏飞. 快速多层次关联规则的挖掘.计算机学报. 1998, 21(11). 1037-1041
    [84] 范明,牛常勇等. 一种挖掘多维关联规则的有效算法. 计算机科学. 2001, 28(11). 44-47
    [85]杨学兵, 蔡庆生. 基于数据立方体的维内关联规则挖掘算法. 北京科技大学学报. 2003, 25(1). 83-86
    [86] Toivonen H.,"Sampling Large Databases for Association Rules". Proceedings of the 22nd International Conference on Very Large Databases, Bombay, India, September 1996.
    [87] Lin DI, Kedem ZM. Pincer-Search: A new algorithm for discovering the maximum frequent set. In: Schek HJ, ed. Proceedings of the 6th European Conference on Extending Database Technology. Heidelberg: Springer-Verlag, 1998. 105~119.
    [88] Bayardo R. Efficiently mining long patterns from databases. In: Haas LM, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1998. 85-93.

    [89]R. C. Agarwal, C. C. Aggarwal, and V.V.V. Prasad. Depth first generation of long patterns. In Proc. of the 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 108-118,Boston, MA, USA, 2000.
    [90] J. Pei, J. Han, and R. Mao. CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets. In Proceedings of the A CM-SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000.21-30,
    [91] D. W. Cheung, S. D. Lee, and B. Kao. A General Incremental Technique for Maintaining Discovered Association Rules. In Database Systems for Advanced Applications, pages 185-194, 1997.
    [92]朱玉全,孙志挥等. 快速更新频繁项集. 计算机研究与发展. 2003, 40(1).-94-99
    [93]朱玉全孙志挥等基于频繁模式树的关联规则增量式更新算法计算机学报.2003,26(1).-91-96
    [94]R. Srikant and R. Agrawal. Mining sequential patterns: Proc. 5th Int. Conf. Data engineering (ICDE95), 1995. 3-14,
    [95]R. Srikant and R. Agrawal. Mining sequential patterns: Generalizations and performance improvements. Proc. 5th Int. Conf. Extending Database Technology (EDBT96), Avignon, France, March 1996. 3-17,
    [96] J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M. C. Hsu. PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. Proc. 2001 Int. Conf. Data Engineering (ICDE01), Heidelberg, Germany, April 2001.
    [97] M. J. Zaki. Fastmining of sequential patterns in very large databases. Technical Report 668, Department of Computer Science, Rensselaer Polytechnic Institute, 1997.
    [98] 靳晓明, 陆玉昌, 石纯一. 序列中的一般化局部序列模式发现. 软件学报. 　14(5). 970-976
    [99]陈富赞, 寇纪淞等. 基于邻接网络的序列规则挖掘算法. 系统工程学报. 2002, 17(5). 385-394
    [100]Roberto J. Bayardo and Rakesh Agrawal. Mining the most interesting rules. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999. 145-154
    [101] M. Klemettinen, H. Mannila, P. Ronkainen, H. Toivonen, and A. I. Verkamo Finding interesting rules from large sets of discovered association rules. Proc. 3rd Int. Conf. Information and Knowledge Management (CIKM94), Gaithersburg, MD, November 1994. 401-408,

    [102]Devavrat Shah, Laks V. S. Lakshmanan, Krithi Ramamritham, and S. Sudarshan. Interestingness and pruning of mined patterns. In 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 1999.
    [103]Szymon Jaroszewicz andDanA. Simovici. Ageneral measure of rule interestingness. In Proceedings of the 5th European Conference on Principles and Practice of Knowledge Discovery in Databases, 2001. 253-265,
    [104]娄兰芳, 蒋志方等. 影响关联规则挖掘的有趣性因素的研究. 计算机工程与应用. 2003, 39(6). 190-192
    [105]B. Lent, A. Swami, and J. Widom. Clustering association rules. Proc. 1997 Int. Conf. Data Engineering (ICDE97), Birmingham, England, April 1997. 220-231,
    [106]B. Liu, W. Hsu, and Y. Ma. Mining association rules with multiple minimum supports. Proc. 1999 Int. Conf. Knowledge Discovery and Data Mining (KDD99), San Diego, CA, August 1999. 337-341
    [107]杨炳儒, 陈泓婕.多最小支持度规则的挖掘算法. 计算机工程. 2003, 29(6). 40-41,115
    [108]王振宇, 白石磊等. 多最小支持策略的关联规则挖掘方法. 小型微型计算机系统. 2002, 23(8). 971-973
    [109]R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. Proc. 1996 Int. Conf. Very Large Data Bases (VLDB96), Bombay, India, September 1996. 134-145,
    [110]R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD'97), Newport Beach, California, August 1997. 67-73
    [111]R. J. Bayardo Jr., R. Agrawal, and D. Gunopulos. Constraint-based rule mining in large, dense databases. In Proc. of the 15th Int'l Conf. on Data Engineering, Sydney, Australia, March 1999.
    [112]一种带约束条件的关联规则频繁集挖掘陈晓云计算机工程与应用.2003,39(2).-205-208
    [113]董雁适, 程翼宇等. 基于高频模式树的项约束关联规则发现方法. 浙江大学学报：工学版. 2002, 36(4). 445-450
    [114]崔立新,苑森淼.约束性相联规则发现方法及算法. 计算机学报. 2000, 23(2). 216-220
    [115]卢炎生, 张蕊. 一种交互式可约束的最小关联规则集挖掘算法, 华中科


    技大学学报：自然科学版. 2003, 31(2). 9-10
    [116]颜雪松,蔡之华等. 数据挖掘的并行策略研究. 计算机工程与应用. 2003, 39(3). 187-189
    [117]Agrawal R., Shafer J. C., "Parallel Mining of Association Rules." IEEE Transactions on Knowledge and Data Engineering. 1996.8(6), 962-969,
    [118]Jong Soo Park, Ming-Syan Chen, and Philip S. Yu. Efficient parallel data mining for association rules. In Fourth Int'l Conference on Information and Knowledge Management, Baltimore, Maryland, November 1995.
    [119]M.J. Zaki. Parallel and distributed association mining: A survey. IEEE Concurrency, December 1999. 7(4):14-25
    [120]李航,刘宗田等. 挖掘关联规则的并行算法. 小型微型计算机系统. 2002, 23(10). 1231-1234
    [121] M.J. Zaki, S. Parthasarathy, M. Ogihara, and W. Li. Parallel algorithms for fast discovery of association rules. Data Mining and Knowledge Discovery: An International Journal, December 1997. 1(4), 343-373,
    [122]S. Morishita and A. Nakaya. Parallel branch-and-bound graph search for correlated association rules. In Proceedings of the ACM SIGKDD Workshop on Large-Scale Parallel KDD Systems, volume LNAI 1759, Springer, Berlin, 2000. 127--144.
    [123]程继华施鹏飞模糊关联规则及挖掘算法计算机学报 .1998,21(11).-1037-1041
    [124] 印鉴, 周祥福. 例外关联规则挖掘. 计算机科学. 2003. 30(3) . 40-43
    [125] Aggarwal C. C., and Yu P. S. Online Generation of Association Rules. Proceedings of the Fourteenth International Conference on Data Engineering, ,Orlando, Florida, February 1998. 402-411
    [126]C. Hidber. Online association rule mining. In Proc. of the 1999 ACM SIGMOD Conf. on Management of Data, 1999.
    [127]J. Han, J. Pei and Y. Yin. Mining partial periodicity using frequent pattern trees. Computing Science Technical Report TR-99-10, Simon Fraser University, July 1999.
    128]R. J. Miller and Y. Yang. Association rules over interval data. Proc. 1997 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD97), Tucson, AZ, May 1997. 452-461
    [129]R. Ng, L. V. S. Lakshmanan, J. Han and A. Pang. Exploratory mining and pruning optimizations of constrained association rules. Proc. 1998 ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD’98), Seattle, WA,


    June 1998. 13-24,
    [130]S. Sarawagi, S. Thomas, and R. Agrawal. Integrating association rule mining with relational database systems: Alternatives and implications. Proc. 1998 ACM-SIGMOD Int. Conf. on Management of Data, Seattle, WA, June 1998. 343-354,
    [131]Park J. S., Chen M. S., and Yu P. S. Using a Hash Based Method with Transaction Trimming for Mining Association Rules. IEEE Transactions on Knowledge andData Engineering. September 1997, 9(5), 813-825.
    [132] J. Hipp, A. Myka, R. Wirth, and U. Guntzer. A new algorithm for faster mining of generalized association rules. In Proc. of the 2nd European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD '98), Nantes, France, September 1998.
    [133]R. Motwani, E. Cohen, M. Datar, S. Fujiware, A. Gionis, P. Indyk, J. D. Ullman, and C. Yang. Finding interesting associations without support pruning. InProc. of the 16th Int'l Conf. on Data engineering (ICDE). IEEE, 2000.
    [134] R. T. Ng, L. V.Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained association rules. In Proc. ACM SIGMOD'98, June 1998. 13-24,
    [135] D. W. Cheung, J. Han, V. T. Ng, A. Fu and Y. Fu. A Fast Distributed Algorithm for Mining Association Rules. In Proc. 4th Int. Conf. on Parallel and Distributed Information Systems, Miami Beach, Florida, Dec. 1996.
    [136] M. J. Zaki. Generating non-redundant association rules. In 6th ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, August 2000.
    [137] W. Wang, J. Yang, and P. Yu. Efficient mining of weighted association rules (WAR). IBM Research Report RC 21692(97734), March, 2000.
    [138]J.-L. Lin and M. H. Dunham. Mining association rules: Anti-skew algorithms. In Proceedings of the 14-th Int. Conf. on Data Engineering, Orlando, Florida, USA, 1998. IEEEComputer Society. 486-493,
    [139]H. Toivonen, M. Klemettinen, P. Ronkainen, K. HAtOnen, and H. Mannila. Pruning and grouping discovered association rules.In MLnet Wkshp. on Statistics, Machine Learning, and Discovery in Databases, Apr. 1995.
    [140]B. Liu, W. Hsu, and Y. Ma. Pruning and summarizing the discovered associations. In 5th ACMSIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, Aug. 1999.
    [141]A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In Proc. of The 4th European Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD),


    Lyon, France, September 2000 13-23,
    [142]Frans Coenen, GrahamGoulbourne, and Paul H. Leng. Computing association rules using partial totals. In Proceedings of the 5th European Conference on Principles and Practice of Knowledge Discovery in Databases, 2001. 54-66
    [143]李乃乾，沈钧毅，田絮资. 一种新的普遍化关联规则挖掘算法. 计算机工程. 2003. 29(7) . 4-6
    [144]陆摘，王转,周春光. 基于FP—tree频集模式的FP—Growth算法对关联规则挖掘的影响.吉林大学学报(理学版). 2003. 41(2). 180—185
    [145]黄晓霞, 萧蕴诗,. 数据挖掘集成技术研究. 计算机应用研究. 2003, 20(4). 37-39
    [146]李学明, 刘勇国等. 扩展型关联规则和原关联规则及其若干性质. 计算机研究与发展. 2002, 39(12). 1740-1750
    [147]罗小波, 刘永等. 基于可信度构架的关联规则挖掘算法的研究. 计算机应用研究. 2002, 19(12). 36-39,60
    [148]邹晓峰,陆建江等.语言值关联规则挖掘算法. 系统仿真学报. 2002, 14(9). 1130-1132
    [149]周宇,叶庆卫. 多表关联的关联规则提取SQL实现. 计算机应用研究. 2002, 19(8). 132-133,137
    [150]邓明荣,史烈等. 挖掘泛化序列模式的一种有效方法. 浙江大学学报：理学版. 2002, 29(4). 415-422
    [151]武鹏程,袁兆山. 混合关联规则及其挖掘算法. 小型微型计算机系统. 2003, 24(5). 895-898
    [152]李乃乾,沈钧等. 基于频繁模式树的普遍化关联规则挖掘. 小型微型计算机系统. 2002, 23(12). 1469-1471
    [153]陆建江.加权关联规则挖掘算法的研究.计算机研究与发展. 2002, 39(10). 1281-1286
    [154]李哲，杨兆中，庞炳章. 大型数据库中关联规则的向量法挖掘. 计算机工程. 2003. 29(8) . 97-99
    [155]H.Toivonen. Sampling Large Database for Association Rules. In Proc. of VLDB, Mumbia, India, 1996, 134—145
    [156]M. J. Zaki and M. Ogihara. Theoretical foundations of association rules. In 3rd ACM SIGMODWorkshop on Research Issues in DataMining and Knowledge Discovery, June 1998.
    [157] K.K.Loo, Chi Lap,Ben Kao, David Chung. A lattice-based approach for


    I/O efficient association rule mining. Information Systems. 27(2002), 41-74
    [158]M. J. Zaki and K. Gouda. Fast vertical mining using Diffsets. Technical Report 01-1, Computer Science Dept., Rensselaer Polytechnic Institute, March 2001.
    [159]M. J. Zaki. Scalable algorithms for association mining. IEEE Transactions on Knowledge and Data Engineering, 2000, 12(3), 372-390
    [160]P. Shenoy, J.R. Haritsa, S. Sudarshan, G. Bhalotia, M. Bawa, and D. Shah. Turbo-charging vertical mining of large databases. In ACM SIGMOD Intl. Conf. Management of Data, May 2000.
    [161]J. Hipp, U. Guntzer, and G. Nakhaeizadeh. Mining association rules: Deriving a superior algorithm by analysing today's approaches. In Proc. of the 4th European Conf. on Principles and Practice of Knowledge Discovery, Lyon, France, September 2000.
    [162]J. Hipp, U. Guntzer, and G. Nakhaeizadeh. Algorithms for association rule mining: a general survey and comparison. SIGKDD explorations, June 2000. 2(1), 58-64
    [163]G. I. Webb. Efficient search for association rules. In Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2000), Boston, MA, 2000. 99-107,
    [164]M. J. Zaki, S. Parthasarathy, W. Li, and M. Ogihara. Evaluation of Sampling for Data Mining of Association Rules. In 7th Int. Workshop on Research Issues in Data Engineering (RIDE), Birmingham, UK, 1997. 42-50,
    [165]Hipp, J. andGuntzer, U. and Nakhaeizadeh, G. Algorithms for Association RuleMining : AGeneral Survey and Comparison. SIGKDD Explorations, 2(1):58-64, June 2000.
    [166]E. Omiecinski and A. Savasere. Efficient mining of association rules in large dynamic databases. In Proc. BNCOD'98, 1998. 49-63
    [167]M.J. Zaki, S. Parthasarathy,M. Ogihara, and W. Li. New algorithms for fast discovery of association rules. In 3rd International Conference on Knowledge Discovery and Data Mining, AAAI
    [168]吉根林, 杨明,孙志挥. 快速挖掘全局频繁项目集. 计算机研究与发展. 2003, 40(4). 620-626
    [169]杨明, 孙志挥. 基于前缀广义链表的快速关联规则挖掘算法. 小型微型计算机系统. 2003, 24(5). 899-901
    [170]黄进,尹治本. 关联规则挖掘的Apriori算法的改进. 电子科技大学学报. 2003, 32(1). 76-79

    [171]王多强, 周建红等. 快速关联规则挖掘算法DPD. 华中科技大学学报：自然科学版. 2002, 30(12). 15-17
    [172]李雄飞, 臧雪柏等. 相联规则增量算法研究. 小型微型计算机系统. 2002, 23(11). 1387-1389
    [173]王玮,蔡莲红. 关联规则的高效挖掘算法研究. 小型微型计算机系统. 2002, 23(6). 708-710
    [174]毛国君,刘椿午. 基于项目序列集操作的关联规则挖掘算法. 计算机学报. 2002, 25(4). 417-422
    [175]施润身,赵青. 改进的关联规则采掘算法及其实现. 同济大学学报：自然科学版. 2002, 30(2). 222-225
    [176]黄艳,苑森淼. 一种高效相联规则提取算法. 吉林大学自然科学学报. 1999(2). 36-38
    [177]R. J. Bayardo. Efficiently mining long patterns from databases. In ACM SIGMOD Conf., June 1998. 85-93
    [178] D. Burdick, M. Calimlim, and J. Gehrke. MAFIA: a maximal frequent itemset algorithm for transactional databases. In Intl. Conf. on Data Engineering, Apr. 2001.
    [179]Charu C. Aggarwal, Towards long pattern generation in dense databases, ACM SIGKDD Explorations Newsletter, v.3 n.1, July 2001
    [180]K. Gouda and M.J. Zaki. Efficiently mining maximal frequent itemsets. In ICDM'01.
    [181]路松峰卢正鼎　快速开采最大频繁项目集　软件学报.2001,12(2).-293-297
    [182]宋余庆, 朱玉全, 孙志挥, 陈耿. 基于FP-Tree的最大频繁项目集挖掘及更新算法.软件学报. 14(9). 1586-1592
    [183]Douglas Burdick, Manuel Calimlim, and Johannes Gehrke. MAFIA: A maximal frequent itemset algorithm for transactional databases. In ICDE, pages 443--452, 2001.
    [184]D.-I. Lin and Z. M. Kedem. Pincer-search: A new algorithm for discovering the maximum frequent set. In Intl. Conf. Extending Database Technology, Mar. (1998)
    [185]R. C. Agarwal, C. C. Aggarwal, V. V. V. Prasad. Depth First Generation of Long Patterns. Proceedings of the ACM SIGKDD Conference, 2000.
    [186] N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal: Discovering frequent closed itemsets for association rules. ICDT99.Jerusalem, Israel, January 1999.398-416

    [187]M. J. Zaki and C. Hsiao. Charm: An efficient algorithm for closed association rule mining. In Technical Report 99-10, Computer Science, Rensselaer Polytechnic Institute, 1999.
    [188]Jianyong Wang, Jiawei Han, Jian Pei .CLOSET+: Searching for the Best Strategies for Mining Frequent Closed Itemsets .In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'03)
    [189]Yves Bastide, Nicolas Pasquier, Rafik Taouil, Gerd Stumme, and LotfiLakhal. Mining minimal non-redundant association rules using frequent closed itemsets. In Proceedings of the First International Conference on Computational Logic, 2000. 972-986,
    [190]squier, Y. Bastide, R. Taouil, and L. Lakhal. Efficient Mining of association rules using closed itemset lattices. Information Systems, 1999. 24(1):25-36,
    [191]N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Pruning closed itemset lattices for association rules. Proc. BDA conf., October 1998. 177-196,
    [192] 冯玉才，冯剑琳. 关联规则增量更新.挖掘软件学报.1998, 9(4), 301-306
    [193] Cheung D W, Lee S D, Kao B, A general incremental technique for maintaining discovered association rules. In:Proceedings of databases systems for advanced applications. Melbourne. Austratia. 1997,185-194
    [194]D. W. Cheung, J. Han, V.T.Ng and C.Y._Wong_ Maintenance of Discovered Association Rules in Large Databases_ An Incremental Updating Tech_nique_ In Proceedings of the 12th ICDE_ New Orleans, Louisiana, February 1996
    [195] 朱玉全，孙志挥. 快速更新频繁项集. 计算机研究与发展.2003,40(1).-94-99
    [196]朱玉全，孙志挥等.基于频繁模式树的关联规则增量式更新算法计算机学报. 2003, 26(1). 91-96
    薛锦,陈原斌. 一种实用的关联规则增量式更新算法. 计算机工程与应用. 2003, 39(13). 212-213,217
    邹翔, 张巍. 大型数据库中的高效序列模式增量式更新算法. 南京大学学报：自然科学版. 2003, 39(2). 165-171
    [197]杨明,孙志挥. 一种基于分布式数据库的全局频繁项目集更新算法. 东南大学学报：自然科学版. 2002, 32(6). 879-883
    [198]郑奕莉,徐国定. 基于最近挖掘结果的关联规则更新算法. 计算机工程. 2002,28(9). 159-161

    [199]陈劲松, 施小英. 一种关联规则增量更新算法. 计算机工程. 2002, 28(7). 106-107
    [200]朱玉全,汪晓刚.一种新的关联规则增量式更新算法.计算机工程. 2002, 28(4). 25-27
    [201]陈丽,陈根才.改进的增量式关联规则维护算法, 系统工程理论与实践. 2001, 21(11) 14-19
    [202]朱玉全,孙志挥.一种有效的关联规则增量式更新算法. 计算机工程与应用. 2001, 37(23). 28-29,90
    [203]杨学兵, 高俊波. 可增量更新的关联规则挖掘算法. 小型微型计算机系统. 2000, 21(6). 611-613

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700