空间数据挖掘中若干关键技术研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

About the library

Background
History
Leadership
Organization

Readers' Guide

Opening Hours
Collections
Help Via Email

Publications

Electronic Information Resources

空间数据挖掘中若干关键技术研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Research on Several Key Technologies in Spatial Data Mining
作者：贾俊杰
论文级别：博士
学科专业名称：地球探测与信息技术
中文关键词：空间数据挖掘 ; 聚类 ; 关联规则 ; EM算法 ; 高斯混合模型
英文关键词：spatial data mining ; clustering ; association rules ; Expectation Maximization Algorithm ; Gaussian Mixture Model
学位年度：2009
导师：张勤
学科代码：081802
学位授予单位：长安大学
论文提交日期：2009-01-16
答辩委员会主席：陶本藻

摘要

空间数据挖掘(Spatial Data Mining，SDM)是指从空间数据库中抽取隐含知识、空间关系或非显式存储在数据库中有意义的特征或模式。该技术在理解空间数据、获取空间与非空间数据间内在关系方面具有重要意义。由于近年来空间地理信息系统(Geography Information System，GIS)广泛地应用到各个行业中，积累大量与空间位置相关的空间数据，因此空间数据挖掘研究己成为当前研究的重要课题。
     本论文正是在这种背景下，在系统地讨论空间数据挖掘的基本理论的基础上，对空间数据挖掘的若干关键技术进行重点研究，论文研究成果可归纳如下：
     1．在总结已有研究工作的基础上，研究了位置-属性一体化的实体信息模型，并分析了3种空间距离测度，可以作为空间计算的基础准则；通过对空间权重矩阵进行拓展，介绍了空间实体关联矩阵的概念，并分析建立方法，为空间数据挖掘提供了新的基础工具。
     2．描述了基于模型聚类的混合模型和基本的期望最大化算法(ExpectationMaximization，EM)算法，尽管EM算法具有普遍性，但是它在实际应用中还是常常受到计算效率的限制。EM算法每一步的迭代中需要遍历所有的样本点。如果数据集非常大，则计算强度也会增加。因此，提出了基于随机子样本的节省计算的递增EM(IncreasingEM，IEM)算法，该算法运行在子集而不是完全样本集上，每一次迭代中，只有较少的样本点需要被估计，这使得算法在运行时间上具有可观的改善。通过EM高效的似然判断条件和增量因子，可以对样本子集的容量自动选择。IEM算法提高了计算效率，并且不需要牺牲似然估计的精确度。
     3．由于EM算法不适合空间聚类对空间信息的要求，而邻域EM(Neighborhood EM，NEM)算法虽然结合了空间惩罚项，但是NEM在E-step步需要大量的迭代。为了既能满足空间信息的要求，又能避免过多的计算量，利用IEM的思想，提出了EM与NEM二者相结合的混合递增NEM(Mixed Increasing EM，MNEM)算法，算法首先在随机子样本中进行EM训练，直到似然判断条件下降，根据增量因子进行样本更新，然后样本转向NEM训练一次，如此进行循环递增的交叉训练，使得计算量降低，性能提高。
     4．在包含被相关属性集合所描述的谓词的空间数据库里，进行多概念级空间关联规则挖掘。一个多级关联规则模式是一个频繁谓词集合，在这个集合里，所有构成谓词的项目分别有一个确定的概念层次。本文提出了在空间数据库中挖掘多概念级空间关联规则的新算法，该数据库中存储了经过空间查询和空间计算所获得的空间谓词，并且依据关系表R的空间关系建立母元素表和频繁类匹配表，这使得多概念级空间关联规则挖掘更加方便和有效。
     5．方位信息是图像数据库中最重要的信息类型之一，而9DLT(Nine DirectionLower-Triangular Martix)表达形式是方位表达的基本方法，据此提出了一种在图像数据库中根据空间方位关系挖掘空间关联规则的(9DLT Image Mining，9DIM)算法，在这里每幅图像都被初始化为9DLT字符串，形成类似于事务数据库的图像模式数据库，每个9DLT字符串(图像)代表一个事务。以图像对象之间的关系模式，建立频繁k-1(k＞2)模式库，并由频繁k-1模式库构造频繁k模式树的方式，依次可以发掘到所有对象间的频繁模式。比Apriori算法更加有效。
Spatial data mining of spatial databases is the extraction of implicit knowledge, spatial relations and discovery of interesting characteristics and patterns which are not explicitly represented in spatial databases.The technique can play an important role in understanding spatial data and capturing the intrinsic relationships between spatial and non-spatial data. In recent years Geography Information System(GIS) has been used in many fields.It has become one of the important tasks, which need be studied currently ,because the amount of spatial data obtained from GIS and other sources has been growing tremendously.
     It is under such background that the author effectively studies the corresponding several key technology on the spatial data mining, and systematieally discussed the basic theory of spatial data mining in this thesis. The achievements of this dissertation can be concluded as follows:
     1 .Based on the present research, the author studis the measurement of spatial distance as a basic rule of spatial computation. By extending the method of spatial weighted matrix, the author analyse forth the conception of spatial entity association matrix, and analyse the method of their establishment and offers new basic tools for SDM.
     2.The author describe the mixture model for model-based clustering and the classic form of the Expectation Maximization(EM) algorithm. Despite EM's wide-spread popularity, practical usefulness of EM is often limited by computational inefficiency. EM makes a pass through all of the available data in every iteration. Thus, if the size of the data set is large, every iteration can be computationally intensive. the author introduces the Increasing EM(IEM) algorithm for fast computation based on random sub-sampling.Using only a subset rather than the entire database allows for significant computational improvements since many fewer data points need to be evaluated in every iteration. The author also argue that one can choose the subsets intelligently by appealing to EMs highly-appreciated likelihood-judgement condition and increment factor. IEM algorithm can lead to significant computational improvements without sacrificing accuracy of the results.
     3.EM algorithm is inappropriate spatial clustering to requires conside-ration of spatial information. Although neighborhood EM (NEM) algorithm incorporates a spatial penalty term, it needs more iterations in every E-step. To incorporate spatial information while avoid too much additional computation, the author proposed Mixed Increasing NEM(MNEM) algorithm that combines EM and NEM. In MNEM, the author first train data based on random sub-sampling in EM till the likelihood-judgement condition begins to decrease,and update sub-sampling .Then training is turned to NEM and runs one iteration of algorithm. Because of this cross train of cycle, MNEM algorithm' computational complexity is decreased and capability is advanced.
     4.The multilevel spatial association rules are discovered from a spatial database in which all items of predicates are described by a set of relevant attributes. A multilevel association pattern is a frequent predicate-set in which all items constituting predicates is at a certain concept level, respectively. In this paper, we present a new approach to discover strong multilevel spatial association rules in spatial databases by storing separately the spatial predicates acquired by the execution of spatial query and some efficient spatial algorithms. Then we construct parent element table and frequent class-matched table based on the spatial relations denoted as relation table R.This makes the discovery of multilevel spatial association rules easy and efficient.
     5.Directional information is one of the most important types of information in an image database, and the Nine Direction Lower-Triangular Martix(9DLT) representation is fundamental in this method. Therefore, we propose a novel spatial mining algorithm, called 9DLT Image Mining(9DIM), to mine the spatial association rules from an image database, where every image is represented by the 9DLT representation.Image mode database is similar to Transaction Database because every 9DLT character string express a transaction.According to relation mode among the image object, we construct frequent k-1 (k>2) mode database. By way of construction of frequent k mode tree based frequent k-1 (k>2) mode database, we can mine frequent mode all of object. Since our proposed algorithm prunes most of impossible candidates, it is more efficient than the Apriori algorithm.

引文

[1]J.Han,M.Kamber.Data Mining:Concepts and Techniques[M].Morgan Kaufmann Publishers,Los Altos,CA.2001.
    [2]邸凯昌．空间数据发掘与知识发现[M]．武汉：武汉大学出版社，2000．
    [3]王家耀．空间信息系统原理[M]．北京：科学出版社，2001．
    [4]ESTIVILL-CASTRO V,LEE I.Clustering with obstacles for geographical data mining[J].ISPRS Journal of Photogramme-try and Remote Sensing,2004,59(1):21～34.
    [5]KANEVSKI M,PARKIN R,POZDNUKHOVA,et al.Envi-ronmental data mining and modeling based on machine learningalgorithms and geostatistics[J].Environmental Modelling & Software,2004,19(9):845～855.
    [6]郭仁忠．空间分析[M]．武汉：武汉测绘科技大学出版社，1997．
    [7]J.Han,M.Kamber.Data Mining:Concepts and Techniques,second ed..Morgan Kaufman,2006.
    [8]周海燕．空间数据挖掘的研究[D]．郑州：解放军信息工程大学博士学位论文，2003．
    [9]李德仁．论“GEOMATICS”的中译名[J]．测绘学报，1998，27(2)：98～98．
    [10]张志兵．空间数据挖掘关键技术研究[D]．武汉：华中科技大学博士学位论文，2004．
    [11](美)沙克哈(Shekhar，S．)等著；谢昆青等译．空间数据库[M]．北京：机械工业出版社，2004．
    [12]范明，孟小峰等译．数据挖掘概念与技术[M]．北京：机械工业出版社，2001，8．
    [13]张楠，曲海平，刘念，等．空间数据挖掘的研究进展[J]．微处理机，2007，27(6)：26～29．
    [14]T.Zhang,R.Ramakrishnan,M.Livny.BIRCH:an efficient data clustering method for very large databases[A].ACM SIGMOD Record,June 1996,25(2):103～114.
    [15]S.Guha,R.Rastogi,K.Shim.CURE:An Efficient Clustering Algorithm for Large Database[A].in:Laura M.Haas,Ashutosh Tiwary eds.Proceedings of ACM SIGMOD International Conference on Management of Data.Seattle,USA.June,1998.Atlantic City,NJ,USA:ACM Press,1998:73～84.
    [16]C.Li,G.Biswas.Unsupervised Clustering with Mixed Numeric and Nominal Data-A New Similarity Based Agglomerative System[A].in:Proceedings of the 1~(st) Pacific-Asia Conference on KOD&Data Mining.Singapore.Feb.1997:35～48.
    [17]M.Ester,H.P.Kriegel,J.Sander.A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise.in:Evangelos Simoudis,Jiawei nan,Usama Fayyad eds..Proc,of the 2nd Int.Conf.on Knowledge Discovery and Data Mining.Porland,Oregon,USA.August 2-4,1996.Menlo Park,CA,USA:AAAI/MIT Press,1996:226～231.
    [18]M.Ankerst,M.Breuning,J.Sander.OPTICS:Ordering Points to Identify the Clustering Stucture[A].in:Alex Delis,Christos Faloutsos,Shahram Ghandeharizadeh eds..Proceedings of the ACM SIGMOD International Conference on Management of Data.Philadelphia,Pennsylvania,USA.June 1-3,1999.Atlantic City,NJ,USA:ACM Press,1999:49～60.
    [19]W.Wang,J.Yang,R.Muntz.STING:A Statistical Information Grid Approach to Spatial Data Mining[A].in:Matthias Jarke,Michael J.Carey,Klaus R.Dittrich,et al.eds.Proceedings of the 23rd VLDB Conference.Athens,Greek.August 25-29,1997.San Francisco,California,USA:Morgan Kaufman,1997:186～195.
    [20]G.Sheikholeslami,S.Chatterjee,A.D.Zhang.WaveCluster:A Multi-Resolution Clustering Approach for Very Large Spatial Databases[A].in:Ashish Gupta,Oded Shmueli,Jennifer Widom eds.Proceedings of the 24th VLDB Conference.New York City,NY,USA.August 24-27,1998.San Francisco,California,USA:Morgan Kaufman,1998:428～439.
    [21]C.Ambroise,M.Dang,G.Govaert.Geostatistics for Environmnental Applications.Dordrecht[M].Norwell:Kluwer Academic Publisher,1997:493～504.
    [22]Kohonen T.Se]f Organization and Associative Memory,3rd Edition[M].New York:Springer Verlag,1989:21～62.
    [23]杨春成，张清浦，田向春，等．应用于面状地理实体聚类分析的线段链形状相似性准则[J]．武汉大学学报：信息科学版，2005，30(1)：61～64．
    [24]邹亮，徐建闽．空间数据挖掘在基于GIS的交通诱导系统中的应用[J]．华南理工大学学报，2004，32(S1)：129～132．
    [25]朱庆伟，苏里，谢宏全．基于粗集挖掘法的GIS和RS融合[J]．辽宁工程技术大学学报，2006，25(1)：28～31．
    [26]宋国杰，唐世渭，杨冬青，等．基于最大熵原理的空间特征选择方法[J]．软件学报2003，14(9)：1544～1550．
    [27]K.Koperski,J.Han.Discovery of spatial association rules in geographic information databases[A].in:Proc 4th Int'l Symp.on Large Spatial Databases(SSD'95),Portland,Maine,1995:47～66.
    [28]刘大有，王生生，虞强源，等．基于定性空间推理的多层空间关联规则挖掘算法[J]．计算机研究与发展，2004，41(4)：565～570．
    [29]袁红春，熊范伦．元规则指导下的逐步求精多层空间关联规则挖掘算法[J]．计算机工程，2004，30(8)：34～36．
    [30]D.Y.Li.Knowledge Representation in KDD Based on Linguistic Atoms.In:Proceeedings of the 1st Pacific-Asia Conference on KDD&DM[A],Singapore,1997:23～24.
    [31]毛国君，段立娟，王实，等．数据挖掘原理与算法[M]．北京：清华大学出版社，2005．
    [32]涂建东，陈崇成，樊明辉，等．基于Java3D的空间关联规则可视化原理与实现[J]．高技术通讯，2004，36(6)：98～102．
    [33]陈立潮，张永梅，刘玉树，等．基于栅格的GIS三维空间数据模型[J]．计算机工程，2004，30(8)：4～6．
    [34]肖予钦，张巨，景军，等．基于R树的方向关系查询处理[J]．软件学报，2004，15(1)：103～111．
    [35]寇雅楠，廖志刚，李增智，等．基于AI的主动数据挖掘技术在网络故障管理中的应用[J]．小型微型计算机系统，2003，24(3)：505～508．
    [36]J Han,K Koperski,N Stefanovie.GeoMiner:A System Prototype for Spatial Data Mining[A],Proc 1997 ACM-SIGMOD Int Conf on Management of Data (SIGMOD' 97),Tucson,Arizona,1997.
    [37]袁红春，熊范纶等．一个适用于地理信息系统的数据挖掘工具-GISMiher[J]，中国科学技术大学学报，2002，32(2)：217～224．
    [38]李新运．城市空间数据挖掘方法与应用研究．山东科技大学博士学位论文．2004．
    [39]李德仁，王树良，李德毅等．空间数据挖掘理论与应用．科学出版社．2006．
    [40]Shi W.Z.,Fisher P.F..Spatial Data Quality[M].London:Taylor & Fraancis.2002.
    [41]Hernandez M.A.,Stolfo S.J..Real-world Data is Dirty:Data Cleansing and The Merge/Purge Problem[J].Data Mining and Knowledge Discovery.1998,2:131.
    [42]WangS.L.et al..A try for handling uncertainties in spatial datamining[J].Lecture Notes in Artificial Intelligence.2004,3215:513～520.
    [43]史文中，王树良．GIS中属性不确定性的处理方法及其发展[J]．遥感学报．2002，6(5)：393～ 400.

    [44] 邬伦,刘瑜等.地理信息系统-原理、方法和应用[M].北京:科学出版社,2001.

    [45] Mceliece, R. .Theory of Information and Coding[M]. Addison-Wesley,Reading,MA. 1977.
    [46] Hathaway, R. J.. Another interpretation of the EM algorithm for mixture distributions.Statist[J]. Probab. Lett. 1986(4) :53—56.
    [47] Neal, R., Hinton, G. ,A view of the EM algorithm that justifies incremental, sparse,and other variants[A]. In: Jordan,M. (Ed.), Learning in Graphical Models. Kluwer Academic Publishers, Dordrecht, pp. 355—368.
    [48] A. Dempster, N. Laird, and D. Rubin, Maximum likelihood Estimation from incomplete data via the EM algorithm[J]. J. Royal Statistic Soc., 1977,30:1-38.
    [49] J. Palarea-Albaladejo, J. A. Martin-Fernandez., A modified EM air-algorithm for replacing rounded zeros in compositional data sets[J]. Computers & Geosciences ,2008(34): 902-917.
    [50] Laurent Bordes, Didier Chauveau, Pierre Vandekerkhove. A stochastic EM algorithm for a semiparametric mixture model[J]. Computational Statistics & Data Analysis, 2007(51):5429-5443.
    [51] Bordes, L., Mottelet, S., Vandekerkhove, P.. Semiparametric estimation of a two-component mixture model[J]. Ann. Statist, 2005(34):1204—1232.
    [52] Cruz-Medina, I. R., Hettmansperger, T. P.,. Nonparametric estimation in semi-parametric univariate mixture models[J]. J. Statist. Comput. Simul,2004(74):513—524.
    [53] Heckerman D. Learaning Bayesian net works:The Combination of Knowledge and Statistical Data[J]. Mcahine Learning, 1995, 20:197—243.
    [54] Brian D. Ripley. Pattern Recognition and Neural Networks[M]. Cambridge University Press. 1996.
    [55] P. Cheeseman, J. Stutz. Bayesian Classification(AutoClass) :Theory and Results, Advances in Knowledge Discovery and Data Mining[M]. AAAI/MIT Press., 1996:153—180.
    [56] Rabine, L. R.. A tutorial on hidden Markov models and selected application in speech recognition[J]. Proceeding of IEEE, 1989,77(2):257-285.
    [57] Wolfgang Jank. Ascent EM for fast and global solutions to finite mixtures: An application to curve-clustering of online auctions[J]. Computational Statistics & Data Analysis. 2006(51):747-761.
    [58] Ipsen, I., Lee, D.. Determinant approximations. Technical Report, Department of Mathematics, North Carolina State University. 2003.
    [59] A. B. Tickle, R.Andrews, M. Golea, et al. The truth will come to lightrdirections and challenges in extracting the knowledge embedded within trained artificial neural networks[A]. IEEE Transactions on Neural Networks, 1998, 9(6):1057—1067
    [60] Towell, G.. Geoffrey,Shavlik. Extracting Refined Rules from Knowledge-Based Neural Networks[J].Machine Learning, 1993, 13(1):71-101
    [61] R. T. Ng, Y. Yu. Discovering strong, common and discriminating characteristics of clusters from thematic maps[A], in Proceedings of the eleventh Annual Symp on Geographic Information Systems. Vancouver, BC. February 17-20,1997. Washington, DC:GIS World, 1997.392-394.
    [62] Ng, R., Han, J.. CLARANS: a method for clustering objects for spatial data mining[A]. IEEE Trans. Knowledge Data Eng. 2002, 14(5), 1003—1016.
    [63] Ester,M. ,Kriegel,H. P. .Sander, J. ,Xu, X. .A density-based algorithm for discovering clusters in large spatial databases with noise[A].In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. 1996, 226—231.
    [64] Karypis, G., Han, E.H., Kumar, V.. CHAMELEON: a hierarchical clustering algorithm using dynamic modeling[J]. Computer 1999,32(8): 68—75.
    [65] Estivill-Castro, V., Lee, I.. Fast spatial clustering with different metrics and in the presence of obstacles[A]. In:Proceedings of the Ninth ACM International Symposium on Advances in Geographic Information Systems. 2001:142—147.
    [66] Tung, A. K. H. ,Hou, J., Han,J..Spatial clustering in the presence of obstacles[A]. In:Proceedings of the 17th International Conference on Data Engineering, 2001:359-367.
    [67] Jain, A. K., Farrokhnia, F.. Unsupervised texture segmentation using gabor filters[J].Pattern Recognition, 1991, 24(12): 1167—1186.
    [68] Guo, D., Peuquet, D., Gahegan, M.. Opening the black box: interactive hierarchical clustering for multivariate spatial patterns[A]. In: Proceedings of the 10th ACM International Symposium on Advances in Geographic Information Systems. 2002:131 — 136.
    [69] Murphy, P.M., Aha, D.W.. UCI Repository of machine learning databases. Department of Information and Computer Science, University of California at Irvine, http://www.ics. uci. edu/mlearn/MLRepository.html. 1994.
    [70] Legendre, P.. Constrained clustering[A]. In:Legendre, P., Legendre, L., (Eds.),Developments in Numerical Ecology, NATO ASI Series G 14.1987:289—307.
    [71] Neal, R., Hinton, G.,A view of the EM algorithm that justifies incremental, sparse,and other variants[A]. In: Jordan.M. (Ed.), Learning in Graphical Models. Kluwer Academic Publishers, Dordrecht, pp. 355—368.
    [72] Neukirchen, C., Rottland, J.,Willett,D.,Rigoll,G..A continuous density interpretation of discrete HMM systems and MMI-neural networks[J]. IEEE Trans. Speech Audio Process.2001, 9(4): 367-377.
    [73] Ambroise, C., Govaert, G.. Convergence of an EM-type algorithm for spatial clustering[J].Pattern Recognition Lett. 1998, 19(10):919—927.
    [74] Rasson, J. P., Granville, V.. Multivariate discriminant analysis and maximum penalized likelihood density estimation[J]. J. Roy.Statist.Soc. 1995, B(57):501—517.
    [75] Geman, S., Geman, D.. Stochastic relaxation, gibbs distributions and the bayesian restoration of images[J]. IEEE Trans. Pattern Anal. Mach. Intell. 1984(6):721 —741.
    [76] G. Chen, Q.Wei, Fuzzy association rules and the extended mining algorithms[J].Information Sciences, 2002,147:201—228.
    [77] P. Y. Hsu, Y. L. Chen, C. C. Ling, Algorithms for mining association rules in bag databases [J]. Information Sciences, 2004,166:31—47.
    [78] A. J. T. Lee, W. C. Lin, C.S.Wang. Mining association rules with multi-dimensional constraints[J]. The Journal of Systems and Software, 2006, 79(1) : 79—92.

    [79] Kullback, S., Leibler, R. A.. On information and sufficiency. Ann. Math. Statist, 1951(22):79-86.
    [80] Quinlan, J. R. . C. 4. 5:Programs for Machine Learning[M]. Morgan Kaufmann, San Mateo, CA. 1993.

    [81] Cressie, N.A.. Statistics for Spatial Data[M]. revised Edition. Wiley, NewYork. 1993.

    [82] Xu, L.,Jordan, M. I.. On convergence properties of the EM algorithm for Gaussian mixtures[J].Neural Comput.1996(8):129～151.
    [83]Duda,R.,Hart,P..Pattern Classification and Scene Analysis[M].Wiley,NewYork.1973.
    [84]Bo Fan.A hybrid spatial data clustering method for site selection:The data driven approach of GIS mining[J].Expert Systems with Applications.2008:1～14.
    [85]K.L.Kaufman,P.J.Rousseeuw.Finding Groups in Data:An Introduction to Cluster Analysis[M].New York,USA:John Wiley and Sons,1990:30～66.
    [86]LeSage,J.P..MATLAB toolbox for spatial econometrics.http://www.spatialeconometrics.com.1999.
    [87]J.Han,M.Kamber,Data Mining:Concepts and Techniques[M],Higher Education Press,Beijing,China,2000.
    [88]王珏，苗夺谦，周育键．关于Rough Set理论与应用的综述[J]．模式识别与人工智能，1996．9(4)：337～344．
    [89]陈长清．数据仓库与联机分析处理技术研究[D]．武汉：华中科技大学博士学位论文，2003．
    [90]P.M.Murphy,D.W.Aha.UCI Repository of Machine Learning Databases.Department of Information and Computer Science,University of California,Irvine.http://www.ics.uci.edu/~mlearn/MLRepository.html.1998/2001.4
    [91]Y.C.Hu,G.H.Tzeng,C.M.Chen.Deriving two-stage learning sequences from knowledge in fuzzy sequential pattern mining[J].Information Sciences,2004,159:69～86.
    [92]M.Y.Lin,S.Y.Lee,Interactive sequence discovery by incremental mining[J].Information Sciences,2004(165):187～205.
    [93]A.J.T.Lee,Y.T.Wang,Efficient data mining for calling path patterns in GSM networks[J].Information Systems,2003,28(8):929～948.
    [94]K.Lagus,S.Kaski,T.Kohonen,Mining massive document collections by the WEBSOM method[J].Information Sciences,2004(163):135～156.
    [95]P.Perrin,F.E.Petry,Extraction and representation of contextual information for knowledge discovery in texts[J].Information Sciences,2003(151):125～152.
    [96]Y.Li,S.Zhu,X.S.Wang,S.Jajodia,Looking into the seeds of time:Discovering temporal patterns in large transaction sets[J].Information Sciences,2006(176):1003～1031.
    [97]P.Lingras,M.Hogo,M.Snorek,C.West,Temporal analysis of clusters of supermarket customers:conventional vs.interval set approach[J].Information Sciences,2005(172):215～240.
    [98]U.Fayyad,P.Smyth,Image database exploration:progress and challenges[M].in:Proc.of Knowledge Discovery in Database Workshop,AAAI Press,Menlo Park,CA,1993:27～44.
    [99]K.Koperski,J.Han,Discovery of spatial association rules in geographic information databases[M].in:Proceedings of the Fourth International Symposium,Large Spatial Databases (SSD' 95),Portland,ME,August 1995:47～66.
    [100]Lizhen Wang,Kunqing Xie,Tao Chen,Xiuli Ma.Efficient discovery of multilevel spatial association rules using[J].Information and Software Technology.2005,(47):829～840.
    [101]R.Agrawal T.Imielinski,and A.Swami.Mining association rules between sets of items in lagre database[A].Peter Buneman,Sushil Jajodia.Proeeedings of ACM SIGMOD International Conference on Management of Data(SIGMOD' 93)Washington,D.C,ACM Press,1993:207～216.
    [102]Wan-Jui Lee,Jung-Yi Jiang,Shie-Jue Lee.Mining fuzzy periodic association rules[J].Data & Knowledge Engineering,2008(65):442～462.
    [103] N. Pasquier, Mining association rules using formal concept analysis[A], in:Proceedings of the Eighth International Conference on Conceptual Structures (ICCS' 2000), Darmstadt, Germany, August 2000:259—264.
    [104] R. Agrawal, R. Srikant, Fast algorithms for mining association rules[A].in:Proceedings of the 1994 International Conference on Very Large Data Base (VLDB'94),Santiage, Chile, September 1994:487—499.
    [105] I.H. Toroslu, Repetition support and mining cyclic patterns[J]. Expert Systems with Applications, 2003, 25(3) :303—311.

    [106] J. Han, Y. Fu, Discovery of multiple-level association rules from large databases [A].in: Proceedings of the 1995 International Conference on Very Large Data Bases (VLDB' 95), Zurich, Switzerland, September 1995:420—431.

    [107] A. Savasere, E. Omiecinski, S. Navathe, An efficient algorithm for mining association rules in large databases[A], in: Proceedings of the 1995 International Conference on Very Large Data Bases (VLDB'95), Zurich, Switzerland, September 1995:432—443.
    [108] H. Lu, J.Han, L. Feng, Stock movement and n-dimensional intertransaction association rules[C]. in: Proceedings of the 1998 SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery(DMKD' 98), Seattle, WA, June 1998,12:1-12.
    [109]O. R. Zayane, J. Han, H. Zhu, Mining recurrent items in multimedia with progressive resolution refinement[C]. in: Proceedings of the 2000 International Conference on Data Engineering (ICDE' 00), San Diego,CA, February 2000:461-470.
    [110]X. Lin, Y. Li, C. P. Tsang, Applying on-line bitmap indexing to reduce counting costs in mining association rules[J]. Information Sciences, 1999(120):197—208.
    [111] C. Y. Wang, S. S. Tseng, T. P. Hong, Flexible online association rule mining based on multidimensional pattern relations[J]. Information Sciences, 2006(176):1752—1780.
    [112] J. X. Yu, Z. Chong, H. Lu, Z. Zhang, A. Zhou, A false negative approach to mining frequent itemsets from high speed transactional data streams[J]. Information Sciences 2006(176): 1986-2015.

    [113]J. S. Park, M.S.Chen, and P. S. Yu. An effective hash-based algorithm for mining association rules[A]. Proceedings of ACM SIGMOD International Conference on Management of Data, May 1995:175—186.

    [114]R. Agrawal, R. Srikant. Fast Algorithms for Mining Association Rules[A]. In Proceedings of 20th International Conference Very Large Dat aBases, Morgan Kaufmann 1994:487—499.

    [115]J. Han, and Y. Fu.Discovery of multiple-level association rules in large databases[A].Umeshwar Dayal, Peter M. D. Gray, Shojiro Nishio. Proceedings of the 21st International Conference on Very Large Data Bases, Zurich Switzerland.Morgan Kaufmann, 1995:420—431.

    [116]A. Savasere, E. Omiecinski, and S. Navathe. An efficient algorithm for mining association rules in large database[A]. In Proceedings of the 21st International Conference on Very Large DataBases, 1995:432—444.
    [117] H. Toivonen. Sampling large database for association rules[A]. In Proceedings of the 22nd International Conference on Very Large Data Bases, 1996:134—145.
    [118]S. Brin, R. Motwani, J. D. Ullman, and S.Tsur.Dynameic itemset counting and implication rules for market basket data[A]. In Proceedings of ACM SIGMOD International Conference on Management of Data(SIGMOD' 97), 1997:255-264.
    [119] R. Agrawal, C. Agrawal, and V. V. V. Prasad. A tree projection algorithm for generation of frequent itemsets[J]. In Journal of Parallel and Distributed Computing (Special Issue on High Performance Data Mining), 2000.
    [120]J.Han,J. Pei, and Y.Yin. Mining frequent patterns without candidate generation[A]. In SIGMOD' 2000, Dallas, TX, May 2000.
    [121]Shashi Shekhar, Yan Huang. Discovering Spatial Co-location Patterns:A Summary of Results[A]. In:Proceedings of the Seventh International Symposium on Spatial and Temporal Databases, Redondo Beach, CA, USA. 2001:236—256.
    [122]Yan Huang,Hui Xiong,Shashi Shekhar, et al. Mining Confident Co-location Rules without A Support Threshold[A]. In Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 2003:497—501.
    [l23]Yan Huang. Mining Co-Location Patterns from Large Spatial Datasets[D], PhD dissertation of the University of Minnesota, 1999.
    [124] D. Malerba, F. A. Lisi, AnILPmethod for spatial association rulemining[A]. in:Proceedings of the Workshop on Multi-relational Data Mining(MRDM' 2001), Freiburg,Germany, September 2001:18—29.
    [125]D. Malerba, F. Esposito, F. A. Lisi, Mining spatial association rules in census Data,Specifying Collective Outputs from UN/ECE Activities on Statistical Information Technology, pp. 541—550.
    [126] E. Clementini, P.D. Felice, K. Koperski, Mining multiple-level spatial association rules for objects with a broad boundary [J]. Data and Knowledge Engineering, 2000, 34(3) :251-270.
    [127] M. Kamber, J. Han, J. Y. Chiang, Metarule-guided mining of multidimensional association rules using data cubes[A]. in:Proceedings of the 1997 International Conference on Knowledge Discovery and Data Mining (KDD' 97), Newport Beach, CA, August 1997:207—210.
    [128]邓有莲.GIS数据库中带有决策属性集的空间关联规则挖掘技术研究[D].江西:江西师范大学硕士学位论文,2007.
    [129] L.Z.Wang,A method of the abstract generalization on the bases of the semantic proximity[J]. Chinese Journal of Computation, 2000,23(10):1114—1121.
    [130] L.Z. Wang, L. H. Zhou, H. M. Chen, Research on mining outliers' methods on the basis of the concept hierarchy tree[J]. Journal of Computer Engineering and Applications (in Chinese), 2002, 38(11) :70-74.
    [131]W. Lu, J. Han, B. C. Ooi, Discovery of general knowledge in large spatial databases[A]. in:Proc. of Far East Workshop on Geographic Information Systems, Singapore, 1993:275—289.
    [132]Y. Morimoto, Mining frequent neighbor class sets in spatial databases[A]. in:Proc. of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco,CA,2001:353-358.
    [133]R. Ng, J. Han, Efficient and effective clustering method for spatial data mining[A]. in:Proc. of International Conference on Very Large Data Bases, Morgan Kaufman, San Francisco, CA, 1994:144-155.
    [134]B. Pang, D. Zhang, K. Wang, Tongue image analysis for appendicitis diagnosis[J].Information Sciences, 2005, (175): 160—176.
    [135] K.Sambasiva Rao, K-means clustering for categorical attributes[M],M. Tech. Thesis,December 1998, Indian Institute of Technology, New Delhi, India.
    [136] M.J. Zaki, Scalable algorithms for association mining[J]. IEEE Transactions on Knowledge and Data Engineering, 2000, 12(3):372—390.
    [137]S. J. Yen, L. P. Chen, A graph-based approach for discovering various types of association rules[J]. IEEE Transactions on Knowledge and Data Engineering,2001, 13(5):839—845.
    [l38]Daniel A. Keim, Christian Panse, Mike Sips. Pixel based visual data mining of geo-spatial data[J]. Computers & Graphics, 2004, (28):327—344.
    [139] Y. Li, S. Zhu, X. S. Wang, S. Jajodia, Looking into the seeds of time: Discovering temporal patterns in large transaction sets[J]. Information Sciences ,2006, (176):1003-1031.
    [140]E. G. M. Petrakis, C. Faloutsos, Similarity searching in medical image databases[J]. IEEE Transactions on Knowledge and Data Engineering, 1997, 9(3):435—447.
    [141]V. Megalooikonomou, Evaluating the performance of association mining methods in 3D medical image databases[A]. in:Proc. of the 2nd SIAM International Conference on Data Mining, Arlington, VA, 2002:474—494.
    [142]B. Pang,D.Zhang,K.Wang,Tongue image analysis for appendicitis diagnosis[J].Information Sciences, 2005, (175):160—176.
    [143]R. R. Yager, An extension of the naive Bayesian classifier[J]. Information Sciences, 2006,(176):577-588.
    [144]C. C. Chang, S. Y. Lee, Retrieval of symbolic pictures[J]. Journal of Information Science and Engineering, 1991, 7(3): 405—422.
    [145]Y. J. Kim, C. B. Sim, J. W. Chang, Supporting ranking in iconic image databases [A].in:Proc. of the ACM International Conference onInformation and Knowledge Management,Kansas City, 1999:450—457.
    [146]K. Koperski, J. Han, Discovery of spatial association rules in geographic information databases [A]. in: Proc. of International Symposium on Advance in Spatial Databases, SSD,LNCS, vol. 951, Springer Verlag, 1995:47-66.
    [147]R. Ng, J. Han, Efficient and effective clustering method for spatial data mining[A]. in:Proc. of International Conference on Very Large Data Bases, Morgan Kaufman, San Francisco, CA, 1994:144—155.
    [148]M. Ester, H. P. Kriegel, J. Sander, X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise[A]. in:Proc. of the 2nd International Conference on Knowledge Discovery and Data Mining, Portland, Oregon, 1996:226—231.
    [149]E. Knorr, R. Ng, Finding aggregate proximity relationships and commonalities in spatial data mining[J]. IEEE Transactions on Knowledge and Data Engineering 1996,8(6):884—897.
    [150] Y. C. Huang, Mining frequent spatial co-relation patterns[D]. Master Thesis,Department of Computer Science, National Chengchi University, Taiwan, 2004.
    [151]W. Lu,J. Han,B.C. Ooi,Discovery of general knowledge in large spatial databases[A]. in:Proc. of Far East Workshop on Geographic Information Systems, Singapore, 1993:275—289.
    [152]Y. Morimoto, Mining frequent neighbor class sets in spatial databases[A]. in:Proc. of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA,2001:353～358.
    [153]S.Shekhar,Y.Huang,Discovering spatial co-location patterns:a summary of results[A].in:Proc.of the 7th International.Symposium on Spatial and Temporal Databases (SSTD),2001:236～256.
    [154]Anthony J.T.Lee,et,al..Mining spatial association rules in image databases[J].Information Sciences.2007,(177):1593～1608.
    [155]孙庆先，方涛，郭达志．图像数据挖掘中的关联规则[J]．计算机工程．2006，32(5)．
    [156]Kangkachit T,Waiyamai K.A Business-oriented Spatial Association Rule Mining System Prototype(BoSARM)[Z].http://pindex.ku.ac.th/file_research/KDL_BoSARM.pdf/,2004.
    [157]P.W.Huang,C.H.Lee,Image database design based on 9D-SPA representation for spatial relations[J].IEEE Transactions on Knowledge and Data Engineering,2004,16(12):1486～1496.
    [158]Ding Q.Association Rule Mining on Remotely Sensed Imagery Using P-trees[Z].http:// midas.cs.ndsu.nodak.edu/～ding/,2004.
    [159]X.Zhang,N.Mamoulis,D.W.Cheung,Y.Shou,Fast mining of spatial collocations[A].in:Proc.of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,Seattle,Washington,2004:384～393.
    [160]G.Bordogna,S.Chiesa,D.Geneletti,Linguistic modelling of imperfect spatial information as a basis for simplifying spatial analysis[J].Information Sciences,2006 (176):366～389.
    [161]P.W.Huang,C.H.Lee,Image database design based on 9D-SPA representation for spatial relations[J].IEEE Transactions on Knowledge and Data Engineering,2004,16(12):1486～1496.
    [162]马超飞，刘建强．遥感图像多维量化关联规则挖掘[J]．遥感技术与应用，2003，18(4)：243～247．
    [163]Chang S.K.,Shi Q.Y,and Yan C.W.Iconic Indexing by 2-D Strings[J].IEEE Transaction on Pattern Analysis and Machine Intelligence,1987,PAMI-9(3):413～427.
    [164]Chang S.K.,and Li Y.Representation of Multi-Resolution Symbolic and Binary Pictures Using 2DH Strings[A].In:Processings of IEEE Workshop on Language for Automation,1988:190～195.
    [165]Jungert E.Extended symbolic projection used in a knowledge structure for spatial reasoning[J].In:Proceedings of 4th BPRA Conference on Pattern Recognition,1988.Springer-Verlag,Cambridge,Mar.28～30.
    [166]X.Lin,Y.Li,C.P.Tsang,Applying on-line bitmap indexing to reduce counting costs in mining association rules[J].Information Sciences,1999(120):197～208.
    [167]D.Comaniciu,P.Meer,Robust analysis of feature spaces:color image segmentation[J].in:Proc.of IEEE International Conference on Computer Vision and Pattern Recognition,San Juan,Puerto Rico,1997:750～755.
    [168]J.Wang,J.Han,BIDE:efficient mining of closed sequences[A].in:Proc.of the IEEE International Conference on Data Engineering,Boston,Massachusetts,2004:79～90.
    [169]M.J.Zaki,C.J.Hsiao,Efficient algorithms for mining closed itemsets and their lattice structure[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(4):462～478.
    [170]E.Nardelli,G.Proietti,Efficient secondary memory processing of window queries on spatial data[J].Information Sciences,1995(84):67～83.