远洋船舶调度数据挖掘技术研究与应用

英文题名：Study on Oceangoing Ship Scheduling Data Mining Technology and Its Application
作者：朱飞祥
论文级别：博士
学科专业名称：交通信息工程及控制
中文关键词：远洋船舶调度 ; 数据挖掘 ; 船舶调度数据仓库 ; 粗糙集 ; 关联规则 ; 属性约简
英文关键词：Oceangoing Ship Scheduling ; Data Mining ; Ship Scheduling Data Warehouse ; Rough Set ; Association Rule ; Attribute Reduction
学位年度：2008
导师：张英俊
学科代码：082302
学位授予单位：大连海事大学
论文提交日期：2008-09-01

摘要

数据挖掘作为知识发现过程中的重要步骤,是从大型数据库及数据仓库中提取未知的、有价值的和可操作性的关系、模式和趋势用于决策支持的过程。随着船岸通信技术及计算机存储设备的快速发展,在航运企业中出现了海量的船舶调度数据,如何充分利用数据挖掘技术来分析隐含在船舶调度数据内部的规律是海上智能运输研究领域中的一个值得关注的问题。本文主要研究数据挖掘技术在远洋船舶调度相关问题中的应用,结合数据挖掘中关联分析、数据约简、决策规则获取等算法特点,着重探讨了在全球港口货物装卸分析、船舶航线货物分析、船舶营运油耗分析中的应用。为了使数据更高效地进行挖掘分析,对船舶调度数据仓库的结构与应用进行探讨与设计,最后与各种数据挖掘应用形成一个船舶调度数据挖掘体系。主要研究内容和取得的研究成果如下:
     (1)本文通过调研我国船公司的调度业务,建立面向全球港口货物装卸分析、货物流向分析、船舶节能分析等不同主题的船舶调度数据仓库的结构模型,并对其结构、功能及数据存储模型和实现技术进行研究,从而对海量船舶调度数据进行管理与分析,为后续的挖掘算法提供数据支持。随后建立包括数据层、组织层、挖掘层和决策层的船舶调度数据挖掘体系,各层承担着船舶调度数据挖掘不同阶段的任务,从数据预处理、数据挖掘到知识表达,形成了一个完整的体系。
     (2)针对关联规则挖掘过程中需要多次搜索数据表的问题,分析了粗糙集和关联规则的联系,在单维粗糙集关联算法的启发下,提出了一种基于粗糙集等价类的多维关联算法,将多维频繁项集的求取,转换为多属性的等价类的计算,该算法产生的多维频繁项集只包含用户关心的维度,排除了其他维度的干扰,因而在规则获取方面,更能产生满足用户需求的规则。同时,相比Apriori算法减少了数据库扫描次数,因而提高了算法效率,降低了关联规则的挖掘时间。
     (3)研究了多维数据关联规则挖掘算法在船舶航线货物分析中的应用问题。远洋船舶货物运输的实质就是货物在时空上的一个转移过程,考虑到船舶在一个港口可能装载多种货物,然后在不同港口分别卸货的实际情况,将货物维数据从事务数据库转换到信息系统,然后运用本文提出的基于粗糙集等价类的多维关联算法分析船舶航线、船型、货物以及时间维之间的关系,得到了航线船型分布、航线货物流向等船公司感兴趣的规则,也验证了本文提出的算法实用性。
     (4)给出了一种计算正域的改进算法。正域是粗糙集中一个重要的基本概念,依赖度和分类质量的属性约简算法及属性重要度的计算都涉及到正域求解,本文深入分析了正域的定义特点,根据算法中先前的计算结果,及时删除不需要比较的对象,可以大大降低后续计算中物标对的组合数,从而减少计算量,提高计算效率。利用来自UCI(University of California Irvine)的机器学习数据集测试,结果证明该算法相比经典的正域求取算法,效率明显提高,针对大数据集效率提升更为明显。
     (5)众所周知,求所有最小属性的约简是NP问题,本文提出一种以属性多样性为启发条件的基于分类能力的启发式算法,简化了启发式条件,用分类能力计算替换正域计算,相比基于正域的属性约简算法,提高了算法的效率。利用来自UCI的机器学习数据集测试,结果证明该算法相比经典的正域求取算法,效率有明显提高。
     (6)船舶营运油耗是一个受多因素影响的综合性过程,需要对船舶营运中油耗因素展开分析。然而在实际调度报文中,船舶营运油耗的某些属性的属性值存在遗失,是不完备的,因此本文首先将营运油耗数据的属性值完备化,然后利用计算正域改进算法确定船舶营运过程中油耗的主要因素,利用粗糙集属性约简算法对油耗属性进行约简,从而获得有意义的决策规则,为船舶营运过程制定合理节能措施提供理论依据。
     最后,对全文进行了总结,并对有待进一步研究的问题进行了展望。
As an important step in the knowledge discovery, data mining is the process of extracting unknown, valuable and workable relationship, patterns and trends from the large-scale database and data warehouse for decision-making supporting. As the rapid development of the ship-shore communications technology and computer storage devices , ship scheduling data emerges in shipping enterprises, so how to make full use of data mining technology to analyze the implicit rule from the ship scheduling data is one concerned issue of intelligent transportation on sea. Combining with the characteristics of association analysis, data reduction, acquisition of decision-making rules, etc., this dissertation mainly researches on the application of data mining technology on ocean-going vessels scheduling problems, and discusses the application of those methods to the analysis of global port cargo handling, analysis of cargo shipping routes, analysis of fuel consumption for ship sailing. For a more effiective data mining analysis, this dissertation designs and implements the ship scheduling data warehouse, demonstrates its applications. Ship scheduling data warehouse and its applications are integrated into the ship scheduling system. The research contents and results of this dissertation as follows:
     (1) Through the study of China's shipping companies scheduling operations, this dissertation establishes the differenet themes in ship sheduling data warehouse, including theme of global port cargo handling, theme of cargo flows and voyage, theme of energy-saving,etc. The structure, model, function, data storage model and realization of the ship scheduling data warehouse are all studied to manage and analyze the massive ship scheduling data, which offers data support for follow-up data mining algorithms. Subsequently, the ship scheduling data mining system, which includes data layer, organization layer, mining layer and decision layer, is established. Different layer has its own functions, from data pre-process, data mining to knowledge expression, under differenet stages of data mining task to formulate a whole system.
     (2) Aim at resolving the problem of repeatedly accessing the data table for mining association rule, this dissertation analyses the relation between rough set and association rule, then proposes a multi-dimensional association algorithm based on equivalent category in rough set. In this algorithm, the computing of multi-dimensional frequent items is converted to computing of equivalent category with multi-attributes. So, the number and content of multi-dimensional frequent items and association rules produced by this algorithm are limited by interesting dimensions which are assigned by uesr. Compared with Apriori algorithm, this algoritm reduces the number of accessing and scaning database. So this algorithm decreases the time of computing association rules and is efficient.
     (3) This dissertation researches on the application of the multi-dimensional data mining association rules algorithm in the analysis of cargo flow and ship routes. The essence of oceangoing ship transportation is the changes of cargoes position under time and space dimensions. In a voyage, ship may load many kinds of cargo at the same port and discharge those cargoes in different ports. Concernd this, the cargo dimension data is pre-processed and converted to information system. Then, the interesting rules concerned ship type-ship route and cargo category-ship route are obtained by the multi-dimensional data mining association rules algorithm proposed by this dissertation, which is applied to research the relations of ship-route,ship-type,cargo and time.
     (4) Positive region is a key concept in rough set and plays an important role in calculating the dependency degree of attributes, the ability of classification and the significance of attributes. A new improved algorithm of calcualting positive region is proposed by this dissertation. The new algorithm deletes the compared objects timely and cuts down the combinations of object pairs for next computing. Experiments on data sets from UCI show that the new algorithm on attribute reduction is more efficient than classical algorithm of calculating positive region, especially on large data sets.
     (5) It is well known that finding the shortest reduct is NP hard. In this dissertation, a novel heuristic algorithm based on the ability of classification is proposed for attribute reduction. In the new algorithm, cardinality attributes is used as the heuristic. Compared with the positive region calculating algorithm, the new algorithm calculates the ability of classification, instead of generating positive region. Experiments on data sets from UCI show that the new algorithm is more efficient on attribute reduction in decision information system.
     (6) The process of fuel consumption for ship sailing is complicated and easily influenced by many factors. In fact, some attributes of fuel consumption miss values. In this dissertation, incomplete fuel consumption information system is firstly transformed into complete information system. In order to get the valuable decision rules and support the decision-making on energy saving, the new improved alogorithm of calculating positive region is used to computing the significance of the differnet fuel consumption factors and attribute reduction algorithm is used to compute the redcut of fuel consumption factors.
     Finally, the conclusion is made, and the problems for further study are reviewed.

引文

[1]Jiawei Han. Data Mining:Concepts and Techniques, 2001 by MogranKaumfann Publishers. Inc.

    [2]Agrawal R, Srikant R. Fast Algorithm for Mining Association Rules in Large Database. In Research Report RJ9839, IBM Almaden Research Center, San Joes, CA. 1994, 6.

    [3]AGRAWAL R, IMICLINSKI T, SWAMI A. Mining association rules between sets of items in large databases[A]. In: Proc. of ACM SIGMOD International Conference on Management of Data,Washington, DC, 1993,5:207-216.

    [4]AGRAWAL R, IMICLINSKI T, SWAMI A. Database mining:a performance perspective[J]. IEEE Trans Knowledge and Data Enginnering, 1993, 5:914-925.

    [5]Salim, F. D. ; Seng Wai Loke; Rakotonirainy, A.; Srinivasan, B. ; Krishnaswamy, S. ;Collision Pattern Modeling and Real-Time Collision Detection at Road Intersections。Intelligent Transportation Systems Conference, Sept. 30 2007-Oct. 3 2007 Page(s):161 -166.

    [6]Marukatat, R. ; Structure-Based Rule Selection Framework for Association Rule Mining of Traffic Accident Data, Computational Intelligence and Security, Volume 1, Nov. 2006 page(s):781 - 784.

    [7]Zaboli, Sh. ; Naderi, S. ; Moghaddam, A. M. E. ; Application of Image Mining for Knowledge Discovery of Analyzed Traffic Images. Industrial Technology, 2006. 15-17 Dec. 2006 Page(s):1066 - 1070.

    [8]Hauser, T. A. ; Scherer, W. T. ; Data mining tools for real-time traffic signal decision support & maintenance, Systems, Man, and Cybernetics,Volume 3, 7-10 Oct. 2001 Page(s):1471- 1477.

    [9]Wakefield, N. H. ; Bryant, K. P. J. ; Knight, P. R. ; Azzam, H. ; FUMS/spl trade/ artificial intelligence technologies including fuzzy logic for automatic decision making, Fuzzy Information Processing Society, 2005, 26-28 June 2005 Page(s):25 - 30.
    [10]Letourneau, S. ; Famili, F. ; Matwin, S. ; Data mining to predict aircraft component replacement, Intelligent Systems and Their Applications, Volume 14, Issue 6, Nov.-Dec. 1999 Page(s):59 - 66.

    [11]Brotherton, T. ; Jahns, G. ; Jacobs, J. ; Wroblewski, D. ; Prognosis of faults in gas turbine engines, Aerospace Conference Proceedings, Volume 6, 18-25 March 2000 Page (s): 163-171.
    [12]Rehm,F.;Klawonn,F.;Russ,G.;Kruse,R.;Modern Data Visualization for Air Traffic Management,North American Fuzzy Information Processing Society,24-27 June 2007 Page(s):19- 24.
    [13]Riveiro,M.;Falkman,G.;Ziemke,T.;VisuaI Analytics for the Detection of Anomalous Maritime Behavior,Information Visualisation,2008.Ⅳ '08.,9-11 July 2008 Page(s):273- 279.
    [14]]ohansson,F.;Falkman,G.;Detection of vessel anomalies - a Bayesian network approach,Intelligent Sensors,Sensor Networks and Information,2007.3-6 Dec.2007 Page(s):395 -400.
    [15]王亚琴.道路交通流数据挖掘研究(博士论文).上海复旦大学,2007.
    [16]钟珞等.基于数据挖掘技术的城市隧道交通流分析[J].计算机与数字工程,36(5),2008.196-198.
    [17]王云,苏勇.关联规则挖掘在道路交通事故分析中的应用[J].科学技术与工程,8(7),2008:1824-1827.
    [18]蔡志理.高速公路交通事件检测及交通疏导技术研究(博士论文).吉林大学,2007.[19]谈晓洁.基于知识的交通拥堵疏导决策方法及系统研究(博士论文).东南大学,2005.
    [20]刘兴景,杨东援.公路管理数据仓库及其数据分析技术[J].长安大学学报,23(2),2003.67-72.
    [21]钟足峰,刘伟铭,叶长征.高速公路数据挖掘预处理的研究[J].控制管理。23(3),2007,195-196.
    [22]贾利民等.中国铁路智能运输系统的通用技术平台[J].中国铁路,2006,34-37.
    [23]韩春华.易思蓉,吕希奎.数据挖掘技术在铁路选线中的应用[J].中国铁路,2005,48-51.
    [24]黄康.基于粗糙集理论的行车指挥知识获取与决策的应用研究(博士论文).铁道科学研究院,2005.
    [25]张晋春,李广峰.基于数据仓库的站段计划经营决策支持系统的研究[J].铁道运输与经营,24(8),2002,39-41.
    [26]张晖.铁路货运中心辅助决策支持系统研究(硕士轮文).北京交通大学,2006.
    [27]周亚军.基于商务智能的铁路货运营销分析系统的研究实现(硕士论文).西南交通大学,2006.
    [28]吴淑宁.空中交通流量管理的数据仓库与数据挖掘技术研究(硕士论文).清华大学,2004.
    [29]罗萌.崔德光.数据仓库技术在空中交通流量管理系统中的应用[J].计算机工程与应用,19,2002,254-256.
    [30]胡金海等.基于粗糙集理论的航空发动机性能综合评判[J].系统工程与电子技术,28(5).2006,704-707.
    [31]谢庆华,梁剑,左洪福.基于变精度粗糙集的航空发动机送修等级决策[J].系统工程理论方法应用,15(4),2006,380-384.
    [32]牟军敏,周早建,齐传新.数据挖掘技术在内河交通事故分析和预防中的应用[J].中国航海,1,2004,27-29.
    [33]刘正江。吴兆麟.基于船舶碰撞事故调查报告的人的因素数据挖掘[J].中国航海,2,2004,1-6.
    [34]Pawlak.Z.Rough sets[J].International Journal of Information and Computer Science,1982,11(5):314-356.
    [35]韩祯祥,张琦,文福栓.粗糙集理论及其应用综述[J].控制理论与应用,1999,16(2):153
    [36]征峥,束金龙.基于粗糙集与层次分析法的组合预测方法[J].经济数学,2003.20(4):70-76.
    [37]钟波,周家启,肖智.基于粗糙集与神经网络的电力负荷新型预测模型[J].系统工程理论与实践,2004(6):113-119.
    [38]Lixiang Shena,Han Tong Loh.Applying rough sets to market timing decisions[J].Decision Support Systems,2004(37):583-597.
    [39]徐捷,徐从富,耿卫东,潘云鹤.基于粗糙集理论的动态目标识别及跟踪[J].电子学报,2002(4):605-607.
    [40]徐从富.基于粗糙集理论的通信电台及其装载平台识别[J].计算机工程与应用,2002(10):221-225.
    [41]李继良,朱维彰.基于粗糙集理论的物体与相似灯光的辨识[J].杭州电子工业学院学报,2001,21(6):13-18.
    [42]Roman W.Swiniarskia,Andrzej Skowron.Rough set methods in feature selection and recognition[J].Pattern Recognition Letters,2003(24):833-849.
    [43]Pawlak Z.Rough Classification.Inter JMan-Machine Studies,1984,20:469-483.
    [44]刘清,黄兆华,刘少辉,姚力文.带Rough算子的决策规则及数据挖掘中的软计算[J].计算机研究与发展,1999,36(7):800-804.
    [45]林毅,梁家荣.基于粗糙集的规则的挖掘[J].微机发展,2004,14(9):92-94.
    [46]黄沛.李剑.基于粗糙集理论的续保规则挖掘模型[J].上海交通大学学报,2004,38(4):641-646.
    [47]周勇,毛宇光,王建东.中介粗集及其在数据挖掘中的应用[J].南京航空航天大学学报,2000.32(6):609-613.
    [48]张文宇,薛惠锋,张洪才,彭文样.粗糙集在数据挖掘分类规则中的应用研究[J].西北工业大学学报,2002,20(3).
    [49]The ROSETTA homepage,http://www.idi.ntnu.no/~aleks/rosetta
    [50]J.W.Grzymala -- Busse.LERS:a system for learning from examples based on rough sets · In:Slowinski,R.(ed.):Intelligent decision support · Handbook of applications and advances of rough sets theory.Kluwer Academic publishers,Boston,1992:3-18.
    [51]Jerzy W.Grzymala-Busse,Pankaj Shah.A comparision of rule matching methods used in AQ15 and LERS.Z.W.Ras and S.Ohsuga(eds.):ISMIS 2000,LNAI 1932,2000:148-156.
    [52]Anders Torvill Bjorvand.'Rough Enough' -A system supporting the Rough Sets approach.SCAI 1997:290-291.
    [53]张琦.软计算方法在电力系统中的应用研究,博士学位论文.浙江大学,杭州,1999.
    [54]Ziarko A,Szladow.Adaptive process control using rough sets.Proeeedjngs of the international conference of instrument society of America,Chicago,1993,1421-1430
    [55]张文修,吴伟志,梁吉业.粗糙集理论与方法[M].北京:科学出版社,2001.
    [55]I.Duntsch,G.Gedig.Uncertainty measures of rough set prediction[J],Artificial Intelligence,1998,106:109-137.
    [57]Pawlak Z.Rough Sets-Theoretical Aspects of Reasoning about Data.Kluwer Academic Publishers,Dordrecht,1991.
    [58]Iwinski T B.Algebraic approach to rough sets.Bulletin of the Polish Acadernic of Sciences:Mathematics,1987,35(9-10):673-683.
    [59]Dai Jianhua,Chen Weidong,Pan yunhe.Rough Sets and Brouwer-Zadeh Lattices.Proceedings of 2005 International Conference on Rough Sets andKnowledge Technology(RSKT2006),LNAI 4062.Chongqing,China,Jul.2006,P200-207.
    [60]Kuroki N.Rough ideals in semigroups.Information Seienees,Vol.100,1997:139-16
    [61]Yao Y Y.A comparative study of fuzzy sets and rough sets,Information Sciences,1998,109(1-4):227-242.
    [62]Skowron A and Grzymala-Bausse J W.From rough set theory to evidence theory.In:Yager R R,Fedrizzi M and Kacprzyk Jet al(eds.),Advances in the Dempster-Shafter Theory of Evidence,New York 1994:193-236.
    [63]Yao Y Y and Lingras P J.Interpretations of belief functions in the theory of rough sets,Information Sciences,1998,104(1-2):81-106.
    [64]Szczuka M.Rough Sets and Artificial Neural Networks[R].In;Rough Sets in Knowledge Discovery 2;Applications,Case Studies and Software Systems,Physica-Verlag,Heidelberg,1998;449-470.
    [65]Jelonek J, Krawiec K, Slowinski R. Rough Set Reduction of Attributes and their domains for neural networks. International Journal of Computational Intelligence, 1995,11(2): 339-347.

    [66]Lingras P. Unsupervised rough sets classification using Gas. Journal of Intelligent Information Systems, 2001,16(3):215-228.

    [67]Nguyen S H. Discretization of real value attributes: Boolean reasoning approach[Ph. D Dissertation]. Poland: Warsaw University,1997.

    [68]Nguyen S H and Skowron A. Quantization of real value attributes: Rough set and Boolean reasoning approach. Bulletin of International Rough Set Siciety, 1997,1(1):5-16.

    [69]Nguyen S H and Nguyen H s. Discretization of real value attributes for control problems.In: Proceedings of the Fourth European Congress on Intelligent Techniques and Soft Computing(EUFIT96)1, September 2-5, Aachen, Germany, Verlag Mainz, 1996:188-191.

    [70]Nguyen S H and Nguyen H S. Discretization methods with back-tracking. In: Proceedings of the Fifth European Congress on Intelligent Techniques and Soft Computing, (EUFIT' 97),September 8-11, Aachen, Germany, Verlag Mainz, 1997:201-205

    [71]Nguyen S H and Nguyen H S. Application of discretization methods in control. Proceedings of the Workshop on Robotics, Intelligent Control and Decision Support Systems, February 22-23, Polish-Japanese Institute of Information Technology, Warsaw, 1999:47-52.

    [72]Nguyen S H and Nguyen H S. Discretization Methods in Data Mining. In: Plokowski L,Skowron A(eds): Rough Sets in knowledge Discovery, Physica-Verlag,Heidelberg, 1998:451-482.

    [73]Chlebus B, Nguyen S H. On finding optimal discretization on two attributes. In: L.Polkowski, A. Skowron(eds.)Proc of the first International Conference on Rough Sets and Current Trend in Computing(RSCTC' 98), June 1998, Warsaw, Poland, 1998:537-544.

    [74]Nguyen S H. Discretization Problems for Rough Set Methods. In: Polkowski L, Skowron A(eds. )Proc of the first International Conference on Rough Sets and Current Trend in Computing(RSCTC 98), Warsaw, Poland, 1998:545-552.

    [75]Jia Ping, Dai Jianhua, Chen Weidong, Pan Yunhe, Zhu Miaoliang. Immune Algorithm for Discretization of Decision Systems in Rough Sets theory .Journal of Zhejiang University SCIENCEA, 2006, 7(4):602-606.

    [76]Hu X, Cercone N. Learning in Relational Database: A Rough Set Approach. International Journal of Computational Intelligence, 1995,11(2):323-338.
    [77]王珏,王任,苗夺谦等.基于Rough Set理论的”数据浓缩”.计算机学报,1998,21(5):393-400.
    [78]Miao Duoqian,Wang Jue.An Information-based Algorithm for Reduction of knowledge.IEEEICIP' 97,1997:1155-1158.
    [79]苗夺谦,胡桂荣.知识约简的一种启发式算法.计算机研究与发展,1999.36(6):681-684
    [80]Wang J,Miao D Q.Analysis on Attribute Reduction strategies of Rough Set.Journal of Computer Science & Technology,1998,13(2):189-193.
    [81]Wroblewski J.Finding minimal reducts using genetic algorithm.ICS Research Report 16/95,Insitute of computer science,Warsaw University of Technology,1995
    [82]Dai J H and Li Y X.A hybrid genetic algorithm for reduct of attributes in decision system based on rough set theory.Wuhan University Journal of Natural Sciences,2002,7(3):285-289.
    [83]Slezak D.Approximate reducts in decision tables.In:Proc.Of the sixth International Conference,Information Processing and Management of Uncertainty in knowledge-based Systems(IPMU' 96),Julyl-5,Granada,Spain,1996:1159-1164.
    [84]Slezak D.Searching for Dynamic Reducts in Inconsistent Decision Tables.In:Proceedings of the Seventh International Conference on Information Processing and Management of Uncertainty in knowledge-based Systems(IPMU' 98),Paris,France,July6-10,1998:1362-1369.
    [85]Slezak D.Various approaches reasoning with frequency-based decision reducers:a survey.In:Polowshi L,Lin T Y and Tsumoto S(eds.)Rough sets in Soft Computing and Knowledge Discovery:New Developments,Physica-Verlag,Heidelberg,2000.
    [86]Nguyen S H,Skowron A.Boolean reasoning for feature extraction problems.In:Z.W.Ras,A.Skowron(eds.),Tenth International Symposium on Methodologies for Intelligent Syetems,Foundations of Intelligent Systems(ISMIS' 97).October 15-18,Charlotte,NC,USA,Lecture Notes in Artificial Intelligence 1325,Springer-Verlag,Berlin,1997:117-126.
    [87]Shao J.Finding Reducts with User Specified Criteria.In:Wagner R(eds.),Proc.of 8~(th)International Workshop on Database and Expert Systems Applications(DEXA' 97),September 1-2,Toulouse,France,IEEE Computer Society Press,1997:352-357.
    [88]Skowron A and Stepaniuk J.Information Reduction Based on Constructive Neighborhood Systems.In:Proceedings of the Fifth International Workshop on Rough Sets Soft Computing (RSSC' 97)at Third Annual Joint Conference on Information Sciences(JCIS' 97).Duke University,Durham,NC,USA,1997:158-160.
    [89]Son H.Nguyen,A.Skowron.Boolean reasoning for feature extraction problems.In:Z.W.Ras,A.Skowron(eds.),Tenth International Symposium on Methodologies for Intelligent Systems,Foundations of Intelligent Systems(ISMIS' 97),October 15-18,Charlotte,NC,USA,Lecture Notes in Artificial Intelligence 1325,Springer-Verlag,Berlin,1997:117-126.
    [90]贾平,代建华,潘云鹤,朱淼良.一种基于互信息增益率的新属性约算法,浙江大学学报(工学版),2006,40(6):1041-1044.
    [91]Bazan J,Skowron A and Synak P.Discovery of decision rules from experimental data.Research Report40/94,Institute of Mathematics,Warsaw University,1994.
    [92]Bazan J G,Skowron A,Synak P.Dynamic reducts as a tool for extracting laws from decision tables.Proc.of the Symposium on Methodologies for Intelligent Systems,Charlotte,NC,October 16-19,LNAI 869,Springer-Verlag,Berlin,1994:346-355.
    [93]Bazan J G,Dynamic reducts and statistical inference.In:Proceedings of the Sixth International Conference,Information Processing and Management of Uncertainty in Knowledge-Based Systems(IPMIU' 96)vol.Ⅲ,July 1-5,Granada,Spain,vol,Ⅲ,1996:1147-1152.
    [94]Bazan J G.A Comparison of Dynamic and non-Dynamic Rough Set Methods for Extracting Laws from Decision Table.In:L.Polkowski,A,Skowron(eds.),Rough Sets in Knowledge Discovery,Physica - Verlag,Heidelberg,1998:321-365.
    [95]Skowron A and Polkowski L.Synthesis of decision systems from data tables.In:T.Y.Lin,N.CErone(eds.),Rough Sets in Data Mining and Knowledge Discovery vol.l,Physica-Verlag,1998:500-529.
    [96]Stefanowski J.On rough set based approaches to induction of decision rules.In Polkowski L,Skowron A(eds.)Rough Set in Data Mining and Knowledge Discovery vol.1,Physica-Verlag,1998:500-529.
    [97]Stefanowski J.Rough set base rule induction techniques for classification problems,In porc.6~(th) European Congress on Intelligent Techniques and Soft Computing vol,1,Aachen,1998:109-113.
    [98]Nguyen H S.Discovery of generalized patterns.Proceedings of the Eleventh International Svmaosium on Methodologies for Intelligent Systems,Foundations of Intelligent Systems(ISMIS' 99),June 8-11,Warsaw,Lecture Notes in Artificial Intellligence 1609,Springer-Verlag,Berlin,1999:574-582.
    [99]Polkowski L and Skowron A,Decision algorithms:A survey of rough set theoretic methods.Fundamenta Informaticae,1997,30(3-4):345-358.
    [100]Bazan J G,Nguyen S H,Nguyen T T,Skowron and Stepaniuk J.Decision rules synthesis for object classification.In:E.Orowska(ed.),Incomplete Information:Rough Set Analysis,Physica-Verlag,Heidelberg,1998:23-57.
    [101]Nguyen H S,Nguyen T T,Polkowski L,Skowron A,Synak P and Wroblewski J.Decision Rules for Large Data Tables.In:P.Borne,G.Dauphin-Tanguy.C.Sueur and S.EI Khattabi(eds.),CESA' 96:Proceedings of CESA' 96 IMACS Multiconference:Computational Engineering in Systems APPlications 3/4,July 9-12,Lille,France,GERfEC Lille-Cite Scientifique,1996:942-947.
    [102]Nguyen H S,Nguyen T T,Skowron A and Synak P.Knowledge discovery by rough set methods.In:Nagib C.Callaos(eds),ISAS-96:Proc.of the Internationai Conference on Information Systems Analysis and Synthesis,July 22-26,Orlando,USA,1996:26-33.
    [103]Polkowski L,Skowron A,Synak P and Wroblewski J.Searching for approximate description of decision classes.In:S.Tsumoto,S.Kobayashi,T.Yokomori,H.Tannka and A.Nakamura (eds),The Fourth Inernational Workshop on Rough Sets,Fuzzy Sets,and Machine Discovery (RSFD' 96),University of Tokyo,November 6-8,1996:153-161.
    [104]Mollestad T and Skowron A.A rough set framework for data mining of propositional default rules.In:Z.W.Rasand M.Michalewicz(eds.)ISMI-96:Ninth International Symposiumj on Methodologies for Intelligent Systems,Zakopane,Poland,June 10-13,Lecture Notes in Artificial Intelligence I079,Springer-Verlag,Berlin,1996:448-457.
    [105]代建华,潘云鹤.一种基于分类一致性的决策规则获取算法.控制与决策,2004.19(10):1086-1091.
    [106]Golan R,Ziarko W.Methodology for stock market analysis rough set theory.Proc.of IEEE/IAFE Conference on Computational Intelligence for Financial Engineering,New Jersey,1995:32-40.
    [107]Tsumoto S and Tanaka H.PRIMEROSE:Probabilistic rule induction method based on rough sets and resampling methods.Computational Intelligence,1995,11(2):389-405.
    [108]Tsumoto S,Tanaka H.Induction of Disease Description based on Rough Sets.The 1~(st) online workshop on soft computing,Aug,1996.
    [109]Tsumoto S.Automated extraction of medical expert system rules from clinical databases based on rough set theory.Information Sciences,1998,112(1-4):67-84.
    [110]Tsumoto S. Modelling medical diagnostic rules based on rough sets. In Polkowski L and Skowron A (eds.), Proc. of the First International Conference on rough sets and current trends in computing(RSCTC 98), LNAI 1424, Warsaw, Poland, 1998:475-482.

    [111]Stepaniuk J. Rough Set Data Mining of Diabetes Mellitus Data. In: Ras Z W, Skowron A(eds.)Foundations of Intelligent Systems, 11~(th) International Symposium, ISMIS' 99, Warsaw,Poland, Proceedings Springer Lecture Notes in Computer Science 1609,1999.

    [112]Pawlak Z. An inquiry into anatomy of conflicts, Information Sciences, 1998,109(1-4):65-78.

    [113] 郝丽娜,徐心和.粗糙集神经网络系统在故障诊断中的应用[J].控制理论与应用, 2001,18(5):1855—1858.

    [114]Plonka L and Mrozek A. rule-based stabilization of the inverted pendulum.Computational Intelligence, 1995, 11(2):348-356.

    [115]Czogala E et al. Ideas of a rough fuzzy controllers and its application of the stabilization of a pendulum-car system. Fuzzy Sets and Systems, 1995,72(1):61-73.

    [116]Hu X H and Cercone N. Learning maximal generalized decision rules via discretization,generalization and rough set feature reduction. In: Proc. of the 9th IEEE International Conference on Tools with Artificial Intelligence. 1997:548-556.

    [117]Hu X. Knowledge Discovery in Database: An Attribute-Oriented Rough Set Approach[Ph. D Dissertation]. Canada: University of Regina, 1995.

    [118]Shan Nling and Wojciech Ziarko. Data-based acquisition and incremental modification of classification rules. Computational Intelligence, 1995, 11(2): 357-370.

    [119]Shan N, Ziarko W, Hamilton H J et al. Using rough sets as tools for knowledge discovery[A]. in Fayyad U M (eds.), First International Conference on knowledge Discovery and Data Mining[C], Menlo Park, Canada, 1995:268-273.

    [120]Chan C. A rough set approach to attribute generalization in data mining. Information Sciences, 1998,107(1-4):169-176.

    [121]William H.Inmon. Building the Data Warehouse[M], Fourth Edition, Wiley Publishing. 2005.

    [122]xiaoqiang. Suvr on data warehouse. Technical Report, Hongkong University,1996.

    [123]R. Kimball. TheData Warehouse Toolkit. JohnWilye&Sons.NewYokr, 1996.

    [124]朱德利. SQL Server 2005 数据挖掘与商业智能完全解决方案.北京:电子工业出版社, 2007.

    [125]Agrawal R, Srikant R. Fast Alogorithm for Mining Association Rules. in Proceeding 1994 International Conference Very large Database. Santiageo, Chile. 1994,9:487-499.

    [126]Park J.Chen M, Yu P. An Effective hash based Algorithm for Mining Association Rules.IEEE Trans, On knowledge and Data Engineering. 1997, 9(5):813-825.

    [127]Sarasere A, Omiecinsky E, Navathe S. An efficient algorithm for mining association rules in large databases. In 21~(st) Int. Conf. on very Large Database(VLDB), Zurich,Switzerland, pp. 105-112,1995.

    [128]TOIVONEN H. Sampling Large Database for Association Rules[C]. In Proceedings of the 22nd International Conference on Very Large Data Bases, 1996,9:134-145.

    [129]Brin S, et al. Dynamic Itemset Counting and Implication Rules for Market Analysis. In SIGMOD' 97, pp. 255-264,1997.

    [130]Agarwal R, Aggarwal C, Prasasd V V V. A tree projection algorithm for generation of frequent itemsets. Journal of Parallel and Distributed Computing (Special issue on high Performance Data Mining), 61(3),2001:350-371.

    [13l]Han J, Pei J, Yin Y. Ining frequent patterns without candidate generation. In SIGMOD' 2000, Dallas, TX, 2000:1-12.

    [132] J. Liu, Y. pan, K. Wang, and J. Han. Mining frequent itemsets by opportunistic projection.In Proceedings of ACM SIGKDD, Edmonton, Alberta, Canada, 2002.

    [133] Y. G. Sucahyo, R. Gopalan CT-RPO: A Bottom-Up Non Recursive Frequent Itemset Mining Algorithm using Compressed FP-Tree Data Structure, In Proceeding of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations, Brighton, UK, 2004.

    [134]Nicolas pasquier, Yves Bastide, Rafik Taouil,et al. Discovering Frequent Closed Itemsets for Association Riles. In proc. of the 7th Int. .Conf. on Database Theory(ICDT' 99), 1999:398-416.

    [135]BastideY, TaouilR, PasquierN, et al. Mining frequen patterns with counting inference · SIGKDD Explorations, 2(2), 2000:66-75.

    [136]Cristofor D,Cristofor L,Simovici D.Galois connection and data mining. Journal of Universal Computer Science, 6(1), pp. 66-75, 2000.

    [137]Pei J, Han J, Mao R. CLOSET:An efficient algorithm for mining frequent closed itemsets. In ACM-SIGMOD Workshop on Data Mining and Knowledge Discovery (DMKD' 00), pp. 11-20,2000.

    [138]Zaki MJ,Hsiao C-J. CHARM:An Efficient Algorithm for Closed Itemset Minging. In 2~(nd) SIAM Int. Conf. on Data Mining, pp. 12-28,2002.

    [139]Wang J, Han J, Pei J. CLOSET+:Searching for the best strategies for mining frequent closed itemsets. In Proc. 2003 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining(KDD' 03), pp. 236-245, 2003.

    [140]Dao-I Lin, Zvi M. Kedem. Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Set. In 6~(th) Intl. Conf. Extending Database Technology, 1998: 105-119.

    [141]Burdick D, Calimlira M, Gehrke J. MAFIA:a maximal frequent itemset algorithm for transactional databases. In Int. Conf. on Data Engineering, pp. 443-452, 2001.

    [142]R. J Bayardo. Efficiently mining long patterns from databases. In Proc. of the ACM SIGMOD Int. Conf. on Management of Data, 1998:85-93.

    [143]R.C. Agarwal, C. C.Aggarwal, V.V.V. Prasad. Depth first generation of long patterns. In Proc.of the ACM SIGKDD Conf, 2000:108-118.

    [144]G Karam, J. Z.Mohammed. Efficiently mining maximal frequent itemsets. In Proc. of the ICDM Conf. 2001:163-170.

    [145] R. J. Miller and Y. Yang. Association rules over interval Data. In SIGMOD' 97, pp. 452-461,Arizona, USA, 1997.

    [146]Gyenesei, A. A Fuzzy Approach for Mining Quantitative Association Rules. Turku Centre for Computer Science, Technical Report No. 336. 2000.

    [147]Yuefeng Li.Wangzhong Yang, et al. Multi-Tier granule mining for representations of multidimensional association rules. In Proc. Of the 6th IEEE International Conference on Data Mining(ICDM' 06), 2006:953-958.

    [148]WanXin Xu, RuJing Wang. A novel algorithm of mining multidimensional association rules. In ICIC 2006, LNCIS 344,2006:771-777.

    [149]R. Srikant, R. Agrawal. Mining generalized association rules. In Proc. of the 21st VLDB Conf. Zurich, Switzerland, 1995:407-419.

    [150] J. Han, Y. Fu. Discovery of Multiple-level association rules from large databases. IEEE Transactions on Knowledge and Data Engineering, 11(5), 1999:420-431.

    [151]C. H. Cai.Ada W. C. Fu, C. H. Cheng and W. W. Kwong. Mining association rules with weighted items. In IEEE Int.Database Engineering and Applications Symposium, 1998:68-77.

    [152]Shu. J, Tsang. E, Yeung. D. S, Shi. D. Mining fuzzy association rules with weighted items. In Proc. of IEEE Int. Conf. on System, Man and Cybernetics, pp. 1906-1911, 2000.

    [153]Wei Wang, Jiong Yang, Philip Yu. WAR: weighted association rules for item intensities.Knowledge and Information Systems, 2004(6), 2004:203-229.

    [154]FengTao, Fionn Murtagh, Mohsen Farid. Weighted association rule mining using weighted support and significance framework.In Proc.Of the ninth ACM SIGKDD Int'l Conf.on Knowledge Discovery and Data Mining,2003:661-666.
    [155]B.Liu,W.Hsu,Y.Ma.Integrating classification and association rule mining.In Proc.of the KDD 1998,New York,1998:80-86.
    [156]P.tsaii,C.Lee,A.Chen.An efficient approach for incremental association rule mining.In Proc.of the 3rd Pacific-Asia Conference on methodologies for Knowledge Discovery and Data Mining,London,UK,1999:74-83.
    [157]F.Thabtah.Rule preference effect in association classification mining.Journal of Information and Knowledge Management,Vol5(1),2006:1-7.
    [158]H.Hu,J.Li.Using association rules to make rule-based classifiers robust.In Proc.of the 6th Australasian Database Conference,Newcastle,Australia,2005:47-54.
    [159]F.Thabtah,P.Cowling,Y.Peng.MMAC:A new multi-calss,multi-label association classification approach.In Proc.of the 4th IEEE Internation Conference on Data Mining,Brighton,UK,2004:217-224.
    [160]马昕.粗糙集在数据挖掘领域中的应用.浙江大学博士学位论文,2003,9
    [161]Pawlak Z,Skowron A.Rough Sets Rudiments,Bulletin of IRSS,1999:67-70.
    [162]Gawrys M.Rough sets library version 2.0,Warsaw university of technology,1994.
    [163]UCI Web.http://archive,ics.uci.edu/ml/datasets,html.
    [164]刘少辉,盛秋戬,吴斌等.Roush集高效算法的研究.计算机学报.2003,26(5):524-529.
    [165]Nguyen,S.H.Nguyen,H.S.Some efficient algorithms for rough set methods.Proc.of IPMU1996,Granada,Spain,1996:1451-1456.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700