用户名: 密码: 验证码:
基于领域知识的知识发现研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
目前,知识发现的研究与应用是学术界和应用领域关注的热点,但是面对通常是领域相关的复杂应用环境,现有的知识发现研究在体系结构和算法方面尚未很好地融入领域知识,应用效果难如人意。如何从面向最终用户的角度出发,实现基于领域知识的知识发现,成为了现阶段知识发现研究和应用的焦点问题之一。随着面向服务的架构及语义网技术的发展,如何在面向服务的架构上实现知识发现应用也成为下一代知识发现系统研究领域内一个新的课题。因此本文主要对基于领域知识的知识发现展开讨论,具体在以下两个方面即:面向服务的、可融入领域知识的知识发现体系结构和基于领域知识的知识发现算法等进行了研究,取得了如下创新性研究成果:
     1)、提出了一个新的面向服务的知识发现体系结构SOA4KD,建立了一个知识发现服务的质量评价体系,提出了一个基于示例学习的元学习算法来保障知识发现服务的质量。
     2)、提出了一种基于领域本体服务,用户通过自然语言输入知识发现需求的方案及算法。首先对领域本体进行了分层细化,然后提出了一个扩展的知识发现任务本体,将可能的问题元素和知识发现任务联系起来,最后给出了一个语义解释算法,将用户以自然语言方式表达的需求转化为计算机可理解的形式化语言,并证明了该算法是可靠的和完整的。通过该方法,用户可以自助地在SOA4KD上实现知识发现应用。
     3)、提出一种基于语义距离的最近邻分类方法SDkNN。该方法基于领域本体服务计算语义距离,并将其应用到最近邻算法中,提高了分类性能。经过在UCI数据集以及实际应用数据集中验证,SDkNN的整体性能要优于传统方法,在数据不完整的情况下效果更为明显,实践证明SDkNN有很好的应用价值。
     4)、提出了一种基于本体服务的多层次意外分类规则发现算法。通过逐层知识推送和启发式规则判断意外性,可以有效提高算法效率和精度。实验证明该方法的整体性能要优于传统方法,并存实际应用中得到了验证。
     5)、提出了一种基于VSOM的两阶段神经网络模型,实验表明该模型可以克服传统RBFNN需要手工定义聚类个数和中心的不足,并可以解决数据分布不平衡对预测结果的影响。基于上述神经网络模型提出了一种融合领域知识的保险洪灾损失预测模型,该模型将基于DEM的地形等因子抽象出来,并融合当地的领域知识-洪灾风险图,实验证明该方法可以明显提高模型的泛化能力。
     6)、基于模型-控制-视图(MVC)设计模式实现了一个面向服务的知识发现系统原型。通过在保险公司内部进行应用,展现了该系统的应用潜力。
     最后,对全文进行总结,分析目前研究工作中有待完善的地方,同时指明了进一步研究的方向。
Currently, Research and application of Knowledge Discovery is a hot topic among scientific research and real application. But when facing the always domain-related complicated application environments, Current research fail to achieve satisfying results because they can not incorporate domain knowledge effectively. How to realize domain knowledge based knowledge discovery from enduser's prespective is a hot focus among scientific research and real application. With the rapid development of Service Oriented Architecture and Semantic Web, how to perform service oriented knowledge discovery is a key topic of next generation of knowledge discovery systems. So this thesis is focused on domain knowledge based knowledge discovery, more concretely, research work was carried out around service oriented knowledge discovery architecture and domain knowledge based knowledge discovery algorithms. Following innovative contributions were achieved:1. This thesis proposed a novel service oriented architecture for knowledge discovery-SOA4KD, which embrace a quality evaluation architecture for knowledge discovery service, and an IBL(Instance Based Learning) based meta-learning algorithm is proposed to ensure the quality of knowledge discovery services.2. Based on ontology services, this thesis proposed an approach to enable end-user to input their knowledge discovery requirements using natural language. To achieve this goal, the domain ontology is categorized in further detail, and then an extended knowledge discovery task ontology is proposed, which link the possible problem elements and knowledge discovery tasks. Finally, a semantic interpretation algorithm is proposed to transfer the user requirements expressed with natural language to computer-apprehensible formal language, and the algorithm is proved to be sound and complete. This approach enables end-user to self-build knowledge discovery application on SOA4KD.3. This thesis proposed a novel kNN approach based on semantic
    
    distance------SDkNN. This approach analysis the semantic difference between valuesof an attribute and presents how to calculate the semantic distance based on domain ontologies services, the semantic distance is then used to improve the traditional kNN methods. Experiments on UCI(University of California, Irvine) machine learning repository and real application datasets show that the overall performance of SDkNN outperforms the traditional one, especially when the data is incomplete. SDkNN also has the desirable application value in practice.4. This thesis proposed a method for discovering novel multi-level classification rules using ontology and domain knowledge service. Two heuristic rules are proposed for determining whether an attribute should be divided further and knowledge is pushed layer by layer. An algorithm for discovering unexpected multi-level classification rules using ontology is presented. Experiment shows the feasibility of the method.5. This thesis proposed a domain-knowledge based two-stage neural network model. The contribution is composed of two parts. Firstly, traditional RBFNN model need to determine the numbers and centers of neurons manually and the output of traditional model is always constrained by the intial distribution of source data and is unsatifying especially on unbalanced data. This thesis proposed a VSOM based two-stage neural network model. Experiments show this model can solve above problem effectively. Secondly, based on abovementioned model, an insurance loss prediction model incorprating domain knowledge is proposed, terrain factor based on Digital Elevation Model(DEM) is abstracted as input, an approach is proposed to incorporate local domain knowledge(i.e. flood risk map) with the output results, which improve the generation ability of the model. This is also proved by experiments.6. A prototype system of service oriented knowledge discovery system based on Model-View-Controller design pattern is implemented. The application potential is demonstrated by real application among insurance comp
引文
[Agra96] R.Agrawa], M.Mehta, J.Shafer, R.Srikant, A.Arning, and T.Bolliger. The Quest Data mining System. In: Procceedings of 1996 International Conference on Data Mining and Knowledge Discovery(KDD'96). Portland Oregon. August 1996: 244-249
    [Alex03] Alexander Yates, Oren Etzioni, Daniel S. Weld: A reliable natural language interface to household appliances. In: Proceedings of International Conference on Intelligent User Interfaces 2003: 189-196
    [Avra01] Avraham Leff, James T. Rayfield: Web-Application Development Using the Model/View/Controller Design Pattern. In: Proceedings of 5th IEEE Enterprise Distributed Object Computing Conference, Seattle, USA, September 2001.
    [Bere00] Bettina Berendt and M.Spiloipoulou. Analysing navigation behavior in web sites integrating multiple information systems. VLDB Journal: Special Issue on Databases and the Web, 2000 9(1): 56-75
    [Blak98] C.L. Blake and C.J. Merz. UCI Repository of machine learning databases. Irvine, CA: University of California, Department of Information and Computer Science, http://www.ics.uci.edu/—mlearn/MLRepository.html, 1998.
    [Bren05] Abraham Bernstein, Foster Provost, and Shawndra Hill. Towards Intelligent Assistance for a Data Mining Process: An Ontology-Based Approach for Cost-Sensitive Classification. IEEE Transactions on Knowledge and Data Engeneering, Vol 17, No. 4, 2005:503-518.
    [Brez03] Peter Brezany, J urgen Hofer, A Min Tjoa, and Alexander Wohrer. Towards an Open Service Architecture for Data Mining on the Grid. In: Proceedings of the International Conference on Database and Expert Systems Applications, Prague, September 2003.
    [Cai 91] Cai Y., Cercone N., Han J. Attribute-oriented induction in relational databases. In: Piatetsky-Shapiro, Frawley W.J. eds. Knowledge Discovery in Databases. Menlo Park, California: AAAI Press/The MIT Press, 1991: 213-228
    [Cann03] M. Cannataro and C. Comito. A Data Mining Ontology for Grid Programming. In: proceedings of 1st International Workshop on Semantics in Peer-to-Peer and Grid Computing (SemPGrid2003), May 2003: 113-134.
    [Cann04] Mario Cannataro, Domenico Talia, Semantics and Knowledge Grids: Building the Next-Generation Grid, IEEE Intelligent System, 2004 January/February, pp 56-63
    [Cart92] Carter C L, Hamilton H J, A Fast, On-Line Generalization Algorithm for Knowledge Discovery. In: J. Association of Computing Machinery, 1992, 19(2): 244-247
    
    [Casa02] Casati, F.,Dayal, U., Sayal,. M., and Shan, M. Business process Intelligence, Hewlett-Packard Labs. Technical Report: HPL-2002-119,2002.
    [Chan01] F.-J. Chang. Jin-Ming Liang, Yen-Chang Chen, Flood forecasting using radial basis function neural networks. IEEE Transactions on Systems, Man, and Cybernetics, Part C. 2001, 31(4): 530~535
    [陈99] 陈秀万.洪水灾害损失评估系统—遥感与GIS技术应用研究.中国水利水电出版社.1999.2
    [程00] 程继华,郭建生,施鹏飞.挖掘所关注规则的多策略方法研究.计算机学报,23(1),2000:45~51
    [Cody02] W.F. Cody, J.T, Kerulen, V.Krishna and W.S.Spangler, The integration of Business Intelligence and Knowledge Management, IBM System Journal, Vol 41, No 4, 2002
    [Cure02] V. Curcin, M. Ghanem, Y. Guo, M. Kohler, A. Rowe, J. Syed, P. Wendel. Discovery Net: Towards a Grid of Knowledge Discovery. In: Proceedings of The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Edmonton, Alberta, Canada. July 23~26, 2002.
    [Davi94] David Wai-Lok Cheung, A. W.-C. Fu, Jiawei Han: Knowledge Discovery in Databases: A Rule-Based Attributed-Oriented approach. In: Proceedings Of 1994 International Symposium On Methodologies for Intelligent Systems(ISMIS'94), Charlotte, North Carolina, October 1994, 164~173
    [Daws01] Dawson, CW and Wilby, RL, Hydrological modelling using artificial neural networks, Progress in. Physical Geography, 25(1), 2001:80~108.
    [Delg01] Delgado, M.Snchez, D. Martn-Bautista, M.J.Vila, M.A. Mining association rules with improved semantics in medical databases. Artificial Intelligence in Medicine. 2001, 21:241~245.
    [Fayy96a] U.M.Fayyad, G.Piatetsky-Shapiro, and P.Smyth. Knowledge Discovery and data mining: Towards a unifying framework. In: Proceedings of Second International Conference on Knowledge Discovery and Data Mining, Meolo Park, CA, AAAI Press, 1996, 82~88.
    [Fayy96b] U.M.Fayyad, G.Piatetsky-Shapiro, P.Smyth and R.Uthurusamy, Advances in Knowledge Discovery and Data mining, MIT Press, 1996.
    [Fayy02] Fayyad, Usama, Ramasamy Uthurusamy, Evolving Data Mining into Solutions for Insights, Communications of the ACM, 45 (8), August 2002.
    [冯01] 冯平,崔广涛,钟昀.城市洪涝灾害直接经济损失的评估和预测.水利学报.2001(8):64~68.
    [Frei02] Freitas,A.A., Data mining and knowledge discovery with evolutionary algorithms, Springer-Verlag. 2002
    [Gian03] Giannadakis et ai. InfoGrid: Providing Information Integration for Knowledge Discovery. In: Journal of Information Science 2003:199~226
    [Giuf00] Giovarmi Giuffrida, Wesley W. Chu, Dominique M. Hanssens, Mining Classification Rules from Datasets with Large Number of Many-Valued Attributes, In: Proceedings of 12th International Conference on Extending Database Technology, Konstanz, Germany, March, 2000
    [Grub93] Gruber T R. A translation approach to portable ontology specification. Knowledge Acquisition,. 5(2). 1993:199~220.
    
    [Grub95] Gruber T. Towards principles for the design of ontologies used for knowledge sharing[J]. International Journal of Human-Computer Studies 43 (5/6): 907-928, 1995
    [郭01] 郭建生,赵奕,施鹏飞.一种有效的用于数据挖掘的动态概念聚类算法,软件学报,21(4),2001:582~591
    [Han01a] Jiawei Han, Micheline Kambr. Data Mining Concepts and Techniques. Morgan Kaufman Publishers, 2001
    [Han01b] Eui-Hong Han, George Karypis, Vipin Kumar, Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification, In: D. Cheung, G.J. Williams, Q. Li. Eds. Advances in Knowledge Discovery and Data Mining: 5th Pacific-Asia Conference, PAKDD. Springer-Verlag, 2001.53~65.
    [Han93] J.Han, Y.Cai and N.Cercone. Data-driven discovery of quantitative rules in relational databases. IEEE Trans on Knowledge and Data Engineering, vol 5, 1993: 29~40,
    [Han94a] J Han. Toward efficient induction mechanisms in database systems. Theoretical Computer Science, 1994, 133(1): 161~185
    [Han94b] J Han, Y Fu. Dynamic generation and refinement of concept hierarchies for knowledge discovery in databases. In: Fayyad U M, Uthurusamy R. eds. Proc. Of the KDD'94: the AAAI'94 Workshop on Knowledge Discovery in Databases. Menlo Park, CA: AAAI/MIT Press, 1994. 157~168
    [Han96a] J Han, Y Fu. Exploration of the power of attribute-oriented induction in data mining. In: Fayyad U M. et.al, eds. Advances in Knowledge Discovery and Data Mining. Menlo Park, CA: AAAI/MIT Press, 1996. 399~421
    [Han96b] Jiawei Han, et.al. DBMiner: A System for Mining Knowledge in Large Relational Database. In: Proceedings of 1996 International Conference on Data Mining and Knowledge Discovery(KDD'96). Pages 244~249, Portland Oregon. August 1996.
    [Han96c] J. Han, Y. Fu, K. Koperski, W. Wang, and O. Zaane, DMQL: a data mining query language for relational databases, In: Proceedings of SIGMOD'96 Workshop on Research Issues on Data Mining and Knowledge Discovery, Canada, 1996, 27~33.
    [Han99] J. Han, V. S. Lakshmanan, and R. T. Ng. Constraint-based multidimensional data mining. IEEE Computer, 32(8):46-50, 1999.
    [Hipp02] J. Hipp and U. Guntzer. Is pushing constraints deeply into the mining algorithms really what we want? ACM SIGKDD Explorations, 4(1):50~55, 2002
    [Holt99] Holt, A, G. L. Benwell. Applying case-based reasoning techniques in GIS, International Journal of Geographical Information Science, 1999, 13(1):9~25.
    [Hsu95] Hsu, K.-l., Gupta, H. V., Sorooshian, S. Artificial neural network modeling of the rainfall-runoff process. Water Resources Research, 1995, 31 (10):2517~2530
    [胡04] 胡春明,怀进鹏,孙海龙.基于Web服务的网格体系结构及其支撑环境 研究.软件学报,2004:15(7):1064~1073.
    
    [Huan95] Huang Y. Fu W C. Efficient algorithms for attribute-oriented induction. In: Fayyad U M, Uthurusamy R. eds. Proc. Of the 1st Int. Conf. on Knowledge Discovery and Data Mining. Menlo Park, CA: AAAI Press. 1995. 168~173
    [Hwan04] Sungsoon Hwang. Using Formal Ontology for Integrated Spatial Data Mining. In A.Lagana et al. (Eds.) Proceedings of 2004 International Conference on Computational Science and its Applications (ICCSA 2004 ), LNCS 3044, Springer-Verlag Berlin Heidelberg 2004: 1026~1035
    [Jang93] J.S.R Jang and C.T.Sun. Functional equivalence between radial basis function and fuzzy inference systems. IEEE Transactions Neural Networks, 1993, 4(1): 156~159
    [Kalo01] Kalogeraki, V., Pruyne,J., and Moorsel, A. A peer-to-peer architecture for delivering e-services. Hewlett-Paekard Labs. Technical Report: HPL-2001-181,2001.
    [Karg04] H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha. Data Mining: Next Generation Challenges and Future Directions. MIT/AAAI Press. 2004
    [Krem04] Stefan Kremer, Stefan Smolnik, Lutz Kolbe. Towards Knowledge Discovery through Context Explication. In: Ralph H, Sprague Jr, eds. Proc. of the 37th Hawaii International Conferenee on System Sciences, Hawaii: IEEE Computer Society, 2004.
    [Kris00] S.Krishnaswamy, A.Zaslavsky, S.W.Loke, An Architecture to Support Distributed Data mining Services in E-Commerce Environments. Workshop on Advanced Issues of E-Commerce and Web based Information Systems, 2000.
    [Kris03] S.Krishnaswarny, A Hybrid Model for Delivering Internet-based Distributed Data Mining Services, [PhD dissertation], Monash University, 2003
    [Kurg05] Kurgan, L., & Cios, K.J. CAIM Diseretization Algorithm, IEEE Transactions of Knowledge and Data Engineering, accepted, http://isl.cudenver.edu/lkurgan/publications.htm.,2005
    [Lear96] O'Leary D E. Using AI in Knowledge Management: Knowledge Bases and Ontologies. IEEE Intelligent Systems, 1996:34~39
    [Li03] Baoli Li, Shiwen Yu, Qin Lu. An Improved k-Nearest Neighbor Algorithm for Text Categorization, In: Maosong Sun, Tianshun Yao, Chunfa Yuan, eds., Advances in Computation of Oriental Languages, Proc. of the 20th International Conference on Computer Processing of Oriental Languages, Shenyang, China: Tsinghua University Press, 2003
    [Liu00] Liu, J.J Kwok, J.T. An extended genetic rule induction algorithm. In: Proceedings of the Congress on Evolutionary Computation, La Jolla, California, USA, 2000, 458~463.
    [Liu99] B. Liu, W. Hsu, L. F. Mun, and H. Y. Lee, Finding interesting patterns using user expectation[J]. IEEE Transactions on Knowledge and Data Engineering, vol. 11, pp. 817-832, 1999
    [李02] 李德毅,数据挖掘研究现状,数据仓库之路 http://www.dwway.com.2002.
    
    [李03] 李士雨,杨鹏,张润梅.海河流域水文预报系统软件开发.计算机辅助设计及图形学学报.2003.15(3):368~371
    [李04] 李善平,尹奇华,胡玉杰等.本体论研究综述.计算机研究与发展,2004,41(7):1041~1052
    [Li04] Yuhua Li. Zhengding Lu, Ontology-Based Universal Knowledge Grid: Enabling Knowledge Discovery and Integration on the Grid, In: Proceedings of 2004 IEEE International Conference on Services Computing (SCC'04) Shanghai, China, September 15-18, 2004:557~560
    [梁04] 梁吉业,王俊红.基于概念格的规则产生集挖掘算法.计算机研究与发展,2004,41(8):1339~1344
    [Lu98] Y.Lu. Concept Hierarchies in Data Mining: Specification, Generation and Application. M.Sc. Thesis, Simon Fraser University, Canada, January 1998
    [Lu00] H. Lu, H. Liu, Decision Tables: Scalable Classification Exploring RDBMS Capabilities, In: Proceedings of 26th International Conference on Very Large Data Bases, Cairo, Egypt, September 10-14, 2000.
    [Math93] C.J.Matheus, P.K.Chan and G.P.Shapiro, Systems for Knowledge Discovery in Databases. In IEEE TKDE Special Issue on Learning & Discovery in Knowledge-Based Databases, 1993.
    [Meer97] R.Meersman. An Essay on The Role and Evolution of Data(base) Semantics. In: R.Meersman and L.Mark (eds.) DataBase Application Semantics. Chapman Hall, London, UK, 1997.
    [Mide01] Herman Midelfart, Astrid Lagreid, and Jan Komorowski. Classification of Gene Expression Data in an Ontology. In: J.Crespo, V.Maojo, and F. Martin (Eds.): Proceedings of International Symposium on Medical Data Analysis (ISMDA 2001), LNCS 2199, Springer-Verlag Berlin Heidelberg 2001: 186~194
    [Nasu01] T.Nasukawa and T.Nagano, Text Analysis and Knowledge Mining System, IBM Systems Journal, Vol 40, No 4, 2001.
    [Nech91] Nechies R, Fikes R, Finin T, et al. Enabling technology for knowledge sharing. AI Magazine, 1991, 12(3): 36~56.
    [Oja99] Oja,E., and Kaski,S.: Kohonen Maps. Amsterdam, Elsevier, 1999.
    [OMG01] Object Management Group: Common Warehouse Metamodel (CWM) Specification, OGM Press, Needham, Massachusetts, 2001.
    [Padh01] Padhraic Smyth. Breaking Out of the Black-Box: Research Challenges in Data Mining. In: Proceedings of International Workshop on Data Mining and Knowledge Discovery, 2001:2~7
    [Padm99] B. Padmanabhan and A. Tuzhilin. Unexpectedness as a measure of interestingness in knowledge discovery. In Decision Support Systems, volume 27, Elsevier Science, 1999:303~318.
    [Pal02] S.K.Pal and P.Mitra. Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions. IEEE Transactions on Neural Network, Vol. 13, No 5, 2002.
    [Pare00] R.Paredes and E.Vidal. A nearest neighbor weighted measure in classification problems. In: A.Sanfeliu M.I. Torres, eds., Pattern Recognition and Applications. Frontiers in Artificial Intelligence and Applications. IOS Press, 2000. 44-50.
    
    
    [Pave03] Pavel B. Brazdil . Carlos Soares . Joaquim Pinto Da Costa. Ranking learning algorithms: using 1BL and meta-learning on accuracy and time Results, Machine Learning, Vol.50 No.3. March 2003:251-277.
    [Pei00] Pei J, Mao R, Hu K, and Zhu H, Towards Data Mining Benchmarking: A Test Bed for Performance Study of Frequent Pattern Mining, In: Proceedings. Of 2000 ACM-SIGMOD International Conference on Management of Data, Dallas, TX, May 2000
    [Pena01] J.M.Pena and E.Menasalvas. Towards Flexibility in a Distributed Mining Framework. In: Proceedings of International Workshop on Data Mining and Knowledge Discovery, 2001: 11—14
    [Piat99] GPiatetesky, C.Shapiro, J.Matheus, Knowledge Discovery in Databases: An Overview. In Piatetsky-Shapiro, W.J.Frawley eds. Knowledge Discovery in Databases. Menlo Park, California: AAAI Press/The MIT Press, 1999: 1-27.
    [Pohl02] Carten Pohle and Myra Spiliopoulou. Building and exploiting ad hoc concept hierarchies for web log analysis. In Yahiko Kambayashi, Werner Wininwarter, and Masatoshi Arikawa, eds, Proc. Of the 4th Int'l Conf. on Data Warehousing and Knowledge Discovery, France, Springer-Verlag, 2002: 83-93
    [Pohl03] Carsten Pohle, Integrating and Updating Domain Knowledge in Data Mining, In: Proceedings Of the VLDB 2003 PhD Workshop, Berlin, Germany
    [Quin97] Quinn P F, Beven K J, Lamb R. The ln(a/tanp) index: how to calculate it and how to use it within the TOPMODEL framework. In: Beven K J(Editor), Distributed Hydrological Modeling: Application of TOPMODEL Concept. John Wiley & Sons Ltd, 1997, 31-52.
    [Rodr00] M. Andrea Rodriguez, Assessing Semantic Similarity Among Spatial Entity Classes, [Ph.D. Thesis], Department of Spatial Information Science and Engineering, University of Maine Orono, 2000.
    [Shap00] Gregory Piatetsky-Shapiro, Knowledge Discovery in Database: 10 years after, SIGKDD Exploration, ACM SIGKDD, 2000:1 (1):59~61
    [Shai03] Shaikhali, A., Rana, O., Al-Ali, R., Walker, D. UDDIe: an extended registry for web services. In: Proceedings of Workhshop on Service Oriented Computing: Models, Architectures and Applications at SAINT Conference, Florida, 2003
    [史 02] 史忠植. 知识发现. 清华大学出版社,北京, 2002
    [Silv98] Silverstein, A. Brin, S.Motwani,R. Beyond market baskets: Generalizing association rules to dependence rules. Data Mining and Knowledge Discovery, 1998,2:29-68.
    [Staa03] S. Staab, A. Hotho. Ontology-based Text Document Clustering. In: Mieczyslaw A. Klopotek, Slawomir T. Wierzchon and Krzysztof Trojanowski, Eds. Proceedings, of the International Conference on Intelligent Information
     Systems, Zakopane, Poland: Springer Verlag. 2003.
    
    [Tan02] Tan. P.Kumar. V.Srivastava.J. Selecting the right interestingness measures for association patterns. In: Proceedings. of the 8th International Conference on Knowledge Discovery and Data Mining.Canada.2002.32~41.
    [Tayl97] Merwyn Taylor, Kilian Stoffel, and James Hendler. Ontology-based Induction of High Level Classification Rules. In: Proc. of ACM SIGMOD Data Mining and Knowledge Discovery workshop. Tuscon, Arizona, 1997.
    [王03] 王大玲,于戈等,一种面向数据挖掘预处理过程的领域知识的分类及表示,小型微型计算机系统,2003年5月,863~868
    [Wu03] Zhaohui Wu, Huajun Chen, Jiefeng Xu: Knowledge Base Grid: A Generic Grid Architecture for Semantic Web. Journal of Computer Science & Technology. 18(4): 2003, 462~473
    [Xiao96] Ma Xiaoqiang. Survey on Data Warehouse. Technical Report. HongKong University. 1996
    [Xie02] Y. Xie and V.V Raghavan. Probabilistic logic-based characterization of knowledge discovery in databases. In: Proceedings of ICDM02 Workshop on Foundation of Data Mining and Knowledge Discovery, Japan,2002,107~112
    [杨04] 杨立,左春,王裕国.一种基于语义距离的最近邻分类方法.软件学报,2004,已录用.
    [Zaia98] O.R. Zaiance, M.Xin and J.Han, Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs, Advances in Digital Libraries, 1998.
    [Zaki02] Mohammed J. Zaki, Yi Pan. Introduction: Recent Developments in Parallel and Distributed Data Mining. Kluwer Academic Publishers. 2002
    [Zeng03] Liangzhao Zeng, Boualem Benatallah and Marion Dumas, Quality driven web services composition, In: Proceedings of WWW2003, Hungary, 2003.
    [Zhan02] Zhang, J., Silvescu, A., and Honavar, V. Ontology-Driven Induction of Decision Trees at Multiple Levels of Abstraction. In: Proceedings of Symposium on Abstraction, Reformulation, and Approximation. Alberta, Canada. 2002:316~323
    [Zhan03] Zhang, J. and Honavar, V. Learning decision tree classifiers from attribute value taxonomies and partially specified data. In: Proceedings of the Twentieth International Conference on Machine Learning (ICML-03). Washington, DC. 2003.
    [Zhan04] Zhang, J.and Honavar, V. AVT-NBL: An Algorithm for Learning Compact and Accurate Naive Bayes Classifiers from Attribute Value Taxonomies and Data. In: Proceedings of the Fourth IEEE International Conference on Data Mining (lCDM'04).
    [张04] 张晶.基于领域知识和概念格模型的知识发现研究.[硕士学位论文].合肥工业大学计算机系.2004
    [Zhug04] H.Zhuge, China E-Science Knowledge Grid, IEEE Intelligent System, 2004 January/February, pp 13~17.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700