用户名: 密码: 验证码:
基于文本的领域本体学习方法及其应用研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
领域本体是某一领域(或某一应用)的共享概念模型的形式化的明确说明。目前,领域本体已经被广泛应用于知识管理、语义服务、电子商务、人工智能等多个领域。但是,构建领域本体是一项困难的任务,人工构建费时费力。为此,相关应用领域开始热研能够支持本体(半)自动构建的本体学习方法,即,从数据源中(半)自动地提取本体对象以支持本体构建,提高构建效率并优化结果的机器学习方法。
     在(半)自动构建领域本体的过程中,主要存在三个问题:提取文档词语、构建领域概念集合和构建本体关系集合。相应地,本文研究了三个本体学习方法,用于从中文文本语料中半自动地提取本体对象,简化中文领域本体的构建过程:
     1)提出一种新的词语提取方法——原子词步长法。该方法结合原子词词性分析与串频统计来判断汉字串是否成词并建立词语集合,有效地解决了从文档中自动提取所包含的中文词语集合的问题。
     2)提出一种新的领域概念学习方法——领域隶属同义词分析法。该方法由两个子方法组合而成:领域隶属度分析方法和同义词合并方法,前者用于提取领域专有术语集合,后者用于消除术语集合中的同义现象。在给定合适的文本语料库的情况下,该方法能够解决领域概念集合的构建问题。
     3)提出一种新的本体关系学习方法—概念特征词法。该方法基于概念的特征词模型来计算两个概念之间的相关程度,学习非类属关系。合成既有的类属关系学习方法后,该方法能够有效支持本体关系集合的构建。
     上述三个中文领域本体的学习方法均在中文语料上做过多次试验、性能分析和算法改良,学习结果令人满意。并且,这些方法被综合使用到一个国家自然基金委项目(即信息管理和知识管理领域的术语标准化)中,得到了实际应用和验证。
     本文提出的基于文本的领域本体学习方法具有较高的性能和较强的实用性。在给定合适的文本语料的情况下,采用这些方法可以以人机结合的方式构建面向应用的中文领域本体。实现了构建过程中一定程度的自动化,简化了本体构建任务,从而起到促进本体的产业化发展的作用。经过适当的调整和整合之后,这些本体学习方法还可应用于语义检索、文本摘要等其他诸多领域。
A domain ontology is a formal and explicit specification of a shared conceptualization of a specific domain (or an application). There is an increasing need for domain ontologies in various areas, such as knowledge management, semantic service, e-commerce, artificial intelligence, etc. However, building a domain ontology is a labor-intensive and time-consuming task. In recent years, ontology learning approaches, which use machine learning technologies to (semi-)automatically extract relevant concepts and relations from data sources to form an ontology, have been proposed to support the ontology building process.
     In the (semi-)automatic building process of a domain ontology, there existes 3 main problems, including a) extracting terms from electronic documents, b) building the domain ontology concept set, and c) buiding the ontology relation set. Accordingly, the dissertation proposes 3 ontology learning methods and approaches to offer computational support for semi-automatically building Chinese domain ontologies.
     1) It proposes a new Chinese term extraction method, the by-step-of-atomic-word method. This method combines POS analysis and string frequency statistics to determine whether a Chinese string in a document is a term or not and collect all terms occurring in the document. It is a practical solution to the problem of extracting Chinese terms from electronic documents.
     2) It proposes a new approach of learning the domain ontology concept set from Chinese text corpora. The approach is composed of two methods:the DMD (domain membership degree) anaylsis method for extracting domain-specific terms and a new synonym merging method for eliminating the synonymous terms. Given proper text corpora, it solves the problem of building the domain ontology concept set.
     3) It proposes a new ontology relation learning method, the concept-feature-based method. The method suggests ontology relations, especially the non-taxonomic ones, by analyzing the relevance of features of two concepts. Through combining the established string-inclusion method for learning taxonomic relations, it efficiently supports the building process of the ontology relation set.
     Each of the proposed methods and approaches have been tested on various types of Chinese text corpora and modified for tens of times. They offer much better performance when learning Chinese domain ontologies than currently existing approaches typically do. They have been used and verified in the project of terminology standardization of the information & knowledge management field, which is sponsored by CNCTST and NSFC.
     The methods and approaches proposed in the dissertation are practical and efficient. Given proper text corpora, one can build his/her Chinese domain ontology using the approaches and then build ontology-based applications. The approaches support the semi-automatically building of Chinese domain ontologies, simplify the building process, in that way, promote the industrialization of ontologies. If changed slightly, the methods and approaches may also be used in many other areas, such as semantic retrieval, text summarization, etc.
引文
[1]Dieter Fensel. Ontologies:Silver Bullet for Knowledge Management and Electronic Commerce [M]. Berlin:Springer-Verlag,2000.
    [2]Gruber T. R. A Translation Approach to Portable Ontology Specifications [J]. Knowledge Acquisition,1993,5(2):199-220.
    [3]Rudi Studer, V. Richard Benjamins, Dieter Fensel. Knowledge Engineering: Principles and Methods [J]. Data and Knowledge Engineering,1998,25(1-2):161-197.
    [4]WordNet. http://wordnet.princeton.edu/, (Accessed on Oct 8,2006).
    [5]HowNet. http://www.keenage.com/, (Accessed on Oct 8,2006).
    [6]Aldo Gangemi, Stephan Grimm, Peter Mika, et al. Core Ontology of Software-Software Components-Services [EB/OL]. http://cos.ontoware.org/, (Accessed on Oct 13, 2006).
    [7]Robert Stevens, Carole A. Goble, Sean Bechhofer. Ontology-based Knowledge Representation for Bioinformatics [J]. Briefings in Bioinformatics,2000,1(4): 398-414.
    [8]Mike Uschold, Michael Gruninger. ONTOLOGIES:Principles, Methods and Applications [J]. Knowledge Engineering Review,1996,11(2),1996.6.
    [9]V. R. Benjamins, B. Chandrasekaran, A. Gomez-Perez, et al. A Framework for Understanding and Classifying Ontology Applications [C]. Proceedings of the IJCAI-99 workshop on Ontologies and Problem-Solving Methods. Stockholm, Sweden,1999.8.
    [10]于娟.基于本体语言OWL的知识表示及推理算法研究[D].青岛大学,2006.6.
    [11]刘柏嵩.基于本体的知识管理关键技术研究[J].情报学报,2005,24(2):76-81.
    [12]York Sure, Pascal Hitzler, Andreas Eberhart. The Semantic Web in One Day [J]. IEEE Intelligent System,2005,20(3):85-87.
    [13]王众托.知识系统工程[M].北京:科学出版社,2006.2:148-151.
    [14]W3C. Semantic Web Activity Statement [EB/OL]. http://www.w3.org/2001/se/Activity, (Accessed on Dec.02,2004).
    [15]Tim Berners-Lee. Semantic Web-XML2000 [EB/OL]. http://www.w3.org/2000/Talks/ 1206-xml2k-tbl/, (Accessed on Dec.02,2004).
    [16]Edd Dumbill. Berners-Lee and the Semantic Web Vision [EB/OL]. http://www.xml.c om/pub/a/2000/12/xml2000/timbl.html, (Accessed on Sep.26,2004).
    [17]Chin-liang Chang, Richard Char-tung Lee. Symbolic Logic and Mechanical Theorem Proving [M]. New York:Academic Press,1973.
    [18]Quillian M. R. Word Concepts:A Theory and Simulation of Some Basic Semantic Capabilities [J]. Behavioral Science,1967(12):410-430.
    [19]Minsky M. A Framework for Representing Knowledge [C]. In P. H. Winston (Ed.). The Psychology of Computer Vision. New York:McGraw-Hill,1975:211-277.
    [20]Sowa J. F. Conceptual Structures:Information Processing in Mind and Machine (The Systems Programming Series) [M]. Massachusetts:Addison-Wesley Publishing Company, 1984.
    [21]Lewis J. A., Luger G. F. A Constructivist Model of Robot Perception and Performance [C]. Proceeding of the 22nd Annual Conference of the Cognitive Science Society. Hillsdale, NJ:Erlbaum,2000.
    [22]Lakeoff G., Johnson M. Philosophy in the Flesh [M]. New York:Basic Books,1999.
    [23]George F. Luger.人工智能:复杂问题求解的结构和策略(第四版)[M].史忠植等译.北京:电子工业出版社,2004:143-155.
    [24]Vit Novacek. Ontology Learning [D]. 捷克马萨里克大学信息学院,2005.12:14-15.
    [25]Alexander Maedche, Steffen Staab. The TEXT-TO-ONTO Ontology Learning Environment [C]. Software Demonstration at ICCS-2000-Eight International Conference on Conceptual Structures. Darmstadt, Germany, August 15th,2000.
    [26]Borys Omelayenko. Learning of Ontologies for the Web:the Analysis of Existent Approaches [C]. Proceedings of the International Workshop on Web Dynamics, held in conj. with the 8th International Conference on Database Theory (ICDT'01), London, UK, Jan,2001.
    [27]Fernandez Mariano. Overview of Methodologies for Building Ontologies. Proceedings of IJCAI-99 Workshop on Ontologies and Problem-Solving Methods. Stockholm, Sweden, Aug.2,1999:4.1-4.13.
    [28]Matthew Horridge, Holger Knublauch, et al. A Practical Guide to Building OWL Ontology Using the Protege-OWL Plugin and CO-ODE Tools [EB/OL]. the University of Manchester,2004.8.
    [29]Michael K. Smith, Chris Welty, et al. OWL Web Ontology Language Guide [EB/OL]. http://www.w3.org/TR/2004/REC-owl-guide-20040210/, (Accessed on Jan.1,2005).
    [30]Mike Dean, Guus Schreiber. OWL Web Ontology Language Reference [EB/OL]. http://www.w3.org/TR/2004/REC-owl-ref-20040210/, (Accessed on Jan.1,2005).
    [31]SEKT. http://www.sekt-project.com/, (Accessed on Oct 16,2006).
    [32]SWAP. http://swap.semanticweb.org/, (Accessed on Oct 16,2006).
    [33]Paolo Bouquet, Marc Ehrig, Jerome Euzenat, et al. Specification of a Common Framework for Characterizing Alignment [EB/OL]. KnowledgeWeb deliverable D2.2. 1v2, 2004, http://www.aifb.uni-karlsruhe.de/WBS/phi/pub/kweb-221.pdf. (Accessed on Nov 13,2006).
    [34]Jos de Bruijn. Ontology Mediation [EB/OL]. http://www. keapro. net/sekt/m_ontolo gy_mediation.htm,2004, (Accessed on Oct 16,2006).
    [35]Jerome Euzenat, Thanh Le Bach, Jesus Barrasa, et al. D2.2.3:State of the Art on Ontology Alignment [EB/OL].2004. (Accessed on Nov 9,2006).
    [36]Ontology Matching. http://www.ontologymatching.org/, (Accessed on Nov 9,2006).
    [37]John F. Sowa. Building, Sharing, and Merging Ontologies [EB/OL]. Unpublished paper. 2001. http://www.jfsowa.com/ontology/ontoshar.htm. (Accessed on Nov 5,2006).
    [38]Jos de Bruijn, Marc Ehrig, Cristina Feier, et al. Ontology Mediation, Merging and Aligning [C]. In Davies J, Studer R, Warren P (eds), Semantic Web Technologies: Trends and Research in Ontology-based Systems, Wiley, UK,2006.
    [39]于娟,党延忠.本体集成研究综述[J].计算机科学,2008,35(7):9-13.
    [40]Marc Ehrig, York Sure. Ontology Mapping-An Integrated Approach [C]. Proceedings of the 1st European Semantic Web Symposium. Heraklionm Greece, springer, LNCS, 2004.5:10-12.
    [41]Martin Eklof, Christian Martenson. Ontological Interoperability [EB/OL]. http://www2.foi.se/rapp/foir1943.pdf,2006.1, (Accessed on Nov 28,2006).
    [42]黄烟波,张红宇,李建华,et al.本体映射方法研究[J].计算机工程与应用,2005(18):27-29.
    [43]Noy N. F., Musen M. A. PROMPT:Algorithm and Tool for Automated Ontology Merging and Alignment [C]. Proceedings of the 17th National Conference on Artificial Intelligence (AAAI2000), Austin, Texas, USA,2000.
    [44]Dejing Dou, Drew McDermott, Peishen Qi. Ontology Translation by Ontology Merging and Automated Reasoning [C]. Proceeding EKAW2002 Workshop on Ontologies for Multi-Agent Systems,2002:3-18.
    [45]Alexander Maedche, Boris Motik, Nu no Silva, et al. MAFRA-A MApping FRAmework for Distributed Ontologies [C]. Proceedings of the 13th European Conference on Knowledge Engineering and Knowledge Management EKAW-2002, Madrid, Spain,2002.
    [46]Doan, A., Madhaven, J., Domingos P. et al. Ontology Matching:A Machine Learning Approach [M], in S. Staab & R. Studer, eds, Handbook on Ontologies in Information Systems, Springer-Verlag,2004:397-416.
    [47]Hans-Peter Schnurr, Jurgen Angele. Do Not Use This Gear with a Switching Lever! Automotive Industry Experience with Semantic Guides [C].4th International Semantic Web Conference (ISWC2005). LNCS 3729,2005:1029-1040.
    [48]Asuncion Gomez-Perez. Evaluating Ontology Evaluation IEEE Intelligent Systems [J]. IEEE Intelligent Systems,2004,19(4):74-76.
    [49]Natalya F. Noy. Evaluation by Ontology Consumers [J]. IEEE Intelligent Systems. 2004,19(4):74-76.
    [50]马文峰,杜小勇.领域本体进化研究[J].图书情报工作,2006.6,50(6):71-75.
    [51]萨师炫,王珊.数据库管理系统(第三版)[M].北京:高等教育出版社.2000.2:4-5.
    [52]Database Management systems [EB/OL]. http://elearning. tvm. tcs. co. in/dbms/dbms/ Contents. htm. (Accessed on Nov 14,2008).
    [53]Date C. J. An Introduction to Database Systems (7th edition) [M], Addison-Wesley, 2000.
    [54]Nenad Stojanovic, Jens Hartmann, Jorge Gonzalez. The OntoManager-a System for the Usage-based Ontology Management [C]. Proceedings of FGML Worshop Special Interest Group of German Information Society,2003:858-875.
    [55]Ying Ding. Ontology Management System [EB/OL], http://sw-portal.deri.at/papers /deliverables/dl7_v01. pdf,2004.8.31. (Accessed on Dec 21,2006).
    [56]Robert Harrison, Christine W. Chan. Distributed Ontology Management System [C]. Canadian Conference on Electrical and Computer Engineering,2005.5.
    [57]Werner Ceusters, Peter Martens. LinKFactory:an Advanced Formal Ontology Mana gement System [EB/OL]. http://www.isi.edu/~blythe/kcap-interaction/papers/LinK FactoryXWhiteXPaperXfinal.doc. (Accessed on Dec.12,2006).
    [58]IBM Ontology Management System [EB/OL]. http://www.alphaworks.ibm.com/tech/sno base,2003. (Accessed on Dec.12,2006).
    [59]Ontoprise. OntoStudio [EB/OL]. http://ontoworld.org/wiki/OntoStudio. (Accessed on Dec.16,2006).
    [60]M. Missikoff, F. Taglino. SymOntoX:A Web-Ontology Tool for eBusiness Domains [C]. Proceedings of the Fourth International Conference on Web Information Systems Engineering (WISE'03),2003:343-347.
    [61]Robert Harrison, Christine W. Chan. Distributed Ontology Management System [C]. Canadian Conference on Electrical and Computer Engineering 2005 (CCECE'05), Saskatoon, Canada. May 1-4,2005:661-664.
    [62]Language and Computing, http://www.landcglobal.com/, (Accessed on Jul.12,2007).
    [63]魏哲雄,冯志勇.基于字典技术的本体整合系统[J].计算机应用,2007,27(2):428-430.
    [64]杜小勇,李曼,王珊.本体学习研究综述[J].软件学报,2006,17(9):1837-1847.
    [65]孔敬.本体学习:原理、方法与相关进展[J].情报学报,2006.12,25(6):657-665.
    [66]Alexander Maedche. Ontology Learning for the Semantic Web [D]. University of Karlscruhe, Boston:Kluwer Academic Publishers,2002.
    [67]Alexander Maedche, Steffen Staab. Semi-Automatic Engineering of Ontologies from Text [C], Proceedings of the 12th Internal Conference on Software and Knowledge Engineering, Chicago, USA,2000.
    [68]王淑华.意义组合原理及汉语中的“词”处理[J].宁夏大学学报(人文社会科学版),2007,29(3):15-20.
    [69]陈波.逻辑哲学原理[M].北京:北京大学出版社,2000:27.
    [70]杨梅.现代汉语合成词构词研究[D].南京师范大学,2006.
    [71]姜韶华,党延忠.基于长度递减与串频统计的文本切分算法[J].情报学报,2006,25(1):74-79.
    [72]姜韶华,党延忠,宣照国.无词典抽词的RMMFS和BMMFS方法及其比较研究[J].情报学报,2006,25(4):499-503.
    [73]Lee-Feng Chien. PAT-tree-based Keyword Extraction for Chinese Information Retrieval [C]. In Proceedings of the 20th annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, Pennsylvania, United States, July 27-31,1997.
    [74]付德宇,代成琴.一个面向文本分类的中文特征词自动抽取方法[J].计算机工程与应用,2006.15:165-167.
    [75]Sui Zhifang, Chen Yirong, Hu Junfeng, et al. The Research on the Automatic Term Extraction in the Domain of Information Science and Technology [C]. In Proceedings of the 5th East Asia Forum of the Terminology, Haikou, China. December,2002.
    [76]苑春法,黄昌宁.基于语素数据库的汉语语素及构词研究[J].语言文字应用,1998,(3):83-88.
    [77]ICTCLAS.汉语词法分析系统[EB/OL]. http://www. nlp. org. cn/project/project. php?pr oj_id=6. (Accessed on Nov 6,2007).
    [78]张华平,刘群.基于N-最短路径方法的中文词语粗分模型[J].中文信息学报,2002,(5):1-7.
    [79]IRLAS.哈尔滨工业大学信息检索实验室[EB/OL]. http://ir.hit.edu.cn/. (Accessed onNov 6,2007).
    [80]Agirre E, Ansa 0, Hovy E, Martinez D. Enriching very Large Ontologies Using the WWW [C]. Proceedings of the 1st Workshop on Ontology Learning. Berlin, Germany, 2000.
    [81]Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier. A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping [C]. Proceedings of the LREC, Las Palmas, Canary Island, Spain,2002.
    [82]Roberto Navigli, Paola Velardi, Aldo Gangemi. Ontology Learning and Its Application to Automated Terminology Translation [J]. IEEE Intelligent Systems,2003,18(1): 22-31.
    [83]陈文亮,朱靖波,姚天顺.基于Bootstrapping的领域词汇自动获取[C].第7届全国计算语言学联合学术会议论文集(JSCL 2003),北京:清华大学出版社,2003:67-72.
    [84]郑家恒,卢娇丽.关键词抽取方法的研究[J].计算机工程,2005,31(18):194-196.
    [85]程勇.基于本体的不确定性知识管理研究[D].中国科学院研究生院(计算技术研究所),2005.5.
    [86]何燕,穗志方,段慧明,et al.基于专业术语词典的自动领域本体构造[J].情报学报,2007,26(1):65-70.
    [87]Beatrice Daille. Study and Implementation of Combined Techniques for Automatic Extraction of Terminology [C]. In The Balancing Act:Combining Symbolic and Statistical Approaches to Language, Workshop at the 32nd Annual Meeting of ACL, Las Cruces, Nouveau Mexique,1994.
    [88]Paola Velardi, Paolo Fabriani, Michele Missikoff. Using Text Processing Techniques to Automatically Enrich a Domain Ontology [C]. Proceedings of the International Conference on Formal Ontology in Information Systems, Ogunquit, USA. New York:ACM Press,2001:270-284.
    [89]杜波,田怀凤,王立,等.基于多策略的专业领域术语抽取器的设计[J].计算机工程,2005,31(14):159-160.
    [90]张新,党延忠.基于规则与统计的本体概念自动获取方法研究[J].情报学报,2007,26(6): 813-820.
    [9l]Qun Liu, Sujian Li. Word Similarity Computing Based on How-net [C]. The 3rd Chinese Lexical Semantics Workshop (CLSW'02), Taipei, May 2002.
    [92]Miyoung Cho, Hanil Kim, Pankoo Kim, et al. A New Method for Ontology Merging based on Concept using WordNet [C]. Advanced Communication Technology. ICACT'06. The 8th International Conference,2006.2:1573-1576.
    [93]Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora [C]. Proceedings of the 14th International Conference on Computational Linguistics (Coling-92), Nantes, France, Aug.23-28,1992:539-545.
    [94]Luc ja Iwanska, Naveen Mata, Kellyn Kruger. Fully Automatic Acquisition of Taxonomic Knowledge from Large Corpora of Texts:Limited Syntax Knowledge Representation System Based on Natural Language [C]. Natural Language Processing and Knowledge Processing, MIT/AAAI Press,2000:335-345.
    [95]Khurshid Ahmad, Mariam Tariq, Bogdan Vrusias, et al. Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains [C]. Proceedings of the 25th European Conference on Advances in Information Retrieval (ECIR), Pisa, Italy, April 14-16,2003:502-510.
    [96]Philipp Cimiano, Siegfried Handschuh, Steffen Staab. Towards the Self-Annotating Web [C]. Proceedings of the 13th World Wide Web Conference (WWW2004), New York, USA, May 17-22,2004:462-471.
    [97]Oren Etzioni, Michael Cafarella, Doug Downey, et al. Web-Scale Information Extraction in KnowItAll (Preliminary Results) [C]. Proceedings of the 13th World Wide Web Conference (WWW2004), New York, USA, May 17-22,2004:100-109.
    [98]David Sanchez, Antonio Moreno. Pattern-based Automatic Taxonomy Learning from the Web [J]. AI Communications.2008,21(1):27-48.
    [99]Katja Markert, Malvina Nissim, Natalia N. Modjeska. Using the Web for Nominal Anaphora Resolution [C]. EACL Workshop on the Computational Treatment of Anaphora, 2003.
    [100]Bernhard Ganter, Rudolf Wille. Formal Concept Analysis [M]. Mathematical Foundation, Berlin:Springer Verlag.1999.
    [101]Marek Obitko, Vaclav Snasel, Jan Smid. Ontology Design with Formal Concept Analysis [C]. Proceedings of CLA 2004 International Workshop on Concept Lattices and their Applications Ostrava, Czech Republic,2004:111-119.
    [102]Philipp Cimiano, Andreas Hotho, Steffen Staab. Comparing Conceptual, Divisive and Agglomerative Clustering for Learning Taxonomies from Text [C]. Proceedings of the European Conference on Artificial Intelligence (ECAI), Valencia, Spain, August 23-27,2004:435-439.
    [103]Philipp Cimiano, Andreas Hotho, Steffen Staab. Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis [J]. Journal of Artificial Intelligence Research,2005,24:305-339.
    [104]Douglas H. Fisher. Knowledge Acquisition Via Incremental Conceptual Clustering [J]. Machine Learning,1987,2(2):139-172.
    [105]Marie-Laure Reinberger, Peter Spyns. Unsupervised Text Mining for the Learning of DOGMA-inspired Ontologies [C]. Ontology Learning from Text:Methods, Applications and Evaluation. IOS Press,2005.
    [106]Mark Sanderson, Bruce Croft. Deriving Concept Hierarchies from Text [C]. Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information (SIGIR99), Berkeley, California, USA, August 16-9,1999: 206-213.
    [107]Ido Dagan, Shaul Marcus, Shaul Markovitch. Contextual Word Similarity and Estimation from Sparse Data [C]. Proceedings of the 31st annual meeting on Association for Computational Linguistics (ACL), Columbus, Ohio, June 22-26,1993: 164-171.
    [108]Saif Mohammad, Graeme Hirst. Distributional Measures as Proxies for Semantic Relatedness [EB/OL]. http://ftp.cs. toronto. edu/pub/gh/Mohammad+Hirst-2005.pdf, (Accessed on Jan 27,2010).
    [109]陈浪舟,黄泰翼.一种新颖的词聚类算法和可变长统计语言模型[J].计算机学报,1999,22(9):942-948.
    [110]胡和平,曾庆锐,路送峰.中文词聚类研究[J].计算机工程与科学,2006,28(1):122-124.
    [111]赵军,胡栓柱,樊兴华.一种新的词语相似度计算方法[J].重庆邮电大学学报(自然科学版),2009,21(4):528-532.
    [112]温春,石昭祥,张亮.中文领域本体概念层次获取方法对比研究[J].计算机应用研究,2009.8,26(8):2847-2850.
    [113]袁里驰.基于相似度的词聚类算法和可变长语言模型[J].小型微型计算机系统,2009,30(5):912-915.
    [114]Christopher Brewster, Simon Jupp, Joanne Luciano, et al. Issues in Learning an Ontology from Text [C]. BMC Bioinformatics, Published online 2009 May 6, Suppl 5:S1.
    [115]Philipp Cimiano, Aleksander Pivk, Lars Schmidt-Thieme, et al. Learning Taxonomic Relations from Heterogeneous Sources of Evidence [C]. Ontology Learning from Text:Methods, Evaluation and Applications, IOS Press,2005.7:59-73.
    [116]Sougata Mukherjea, Saurav Sahay. Discovering Semantic Biomedical Relations Utilizing the Web [C]. Pacific Symposium on Biocomputing, Wailea, Maui, Hawaii, USA. January 3-7,2006.
    [117]Saurav Sahay, Sougata Mukherjea, Eugene Agichtein, et al. Discovering Semantic Biomedical Relations Utilizing the Web [J]. ACM Transactions on Knowledge Discovery from Data (TKDD), March 2008,2(1):164-175.
    [118]Martin Kavalec, Alexander Maedche, Vojtech Svatek. Discovery of Lexical Entries for Non-taxonomic Relations in Ontology Learning [C], Proceedings of SOFSEM 2004:Theory and Practice of Computer Science, Lecture Notes in Computer Science, 2004,2932:249-256.
    [119]Peter Wiemer-Hastings, Arthur C. Graesser, Katja Wiemer-Hastings. Inferring the Meaning of Verbs from Context [C]. Proceedings of the 20th Annual Conference of the Cognitive Science Society, Erlbaum, Mahwah, NJ,1998:1142-1147.
    [120]Massimiliano Ciaramita, Aldo Gangemi, Esther Ratsch, et al. Unsupervised Learning of Semantic Relations between Concepts of a Molecular Biology Ontology [C]. Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI 2005), Edinburgh, Scotland, UK,2005:659-664.
    [121]David Sanchez, Antonio Moreno. Learning Non-taxonomic Relationships from Web Documents for Domain Ontology Construction [J]. Data & Knowledge Engineering, 2008.3,64(3):600-623.
    [122]Takahira Yamaguchi. Acquiring Conceptual Relationships from Domain-Specific Texts [C]. Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI 2001) Workshop on Ontology Learning, Seattle, Washington, USA, 2001.
    [123]Tao Jiang, Ah-Hwee Tan, Ke Wang. Mining Generalized Associations of Semantic Relations from Textual Web Content [J]. IEEE Transactions on Knowledge and Data Engineering,2007.2,19(2):164-179.
    [124]Tian Fang, Jiang Peilin, Ren Fuji. A Practical System of Domain Ontology Learning Using the Web for Chinese [C],2009 Fourth International Conference on Internet and Web Applications and Services, Venice/Mestre, Italy, May 24-28,2009: 298-303.
    [125]J. Villaverde, A. Persson, D. Godoy, A. Amandi. Supporting the Discovery and Labeling of Non-taxonomic Relationships in Ontology Learning [J]. Expert Systems with Applications,2009.9,36(7):10288-10294.
    [126]Johanna Volker, Peter Haase, Pascal Hitzler. Learning Expressive Ontologies [C]. Proceeding of the 2008 conference on Ontology Learning and Population: Bridging the Gap between Text and Knowledge,2008:45-69.
    [127]David Manzano-Macho, Asuncion Gomez-Perez, Daniel Borrajo. Unsupervised and Domain Independent Ontology Learning:Combining Heterogeneous Sources of Evidence [C]. Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), Marrakech, Morocco, May 28-30,2008.
    [128]Jon Atle Gulla, Terje Brasethvik. A Hybrid Approach to Ontology Relationship Learning [C]. Proceedings of the 13th International Conference on Natural Language and Information Systems:Applications of Natural Language to Information Systems, London, UK.2008:79-90.
    [129]Ricardo Gacitua, Pete Sawyer, Paul Rayson. A Flexible Framework to Experiment with Ontology Learning Techniques [J]. Knowledge-Based Systems,2008.4,21(3): 192-199.
    [130]Albert Weichselbraun, Gerhard Wohlgenannt, Arno Scharl, et al. Discovery and Evaluation of Non-Taxonomic Relations in Domain Ontologies [J], International Journal of Metadata Semantics and Ontologies,2009,4(3):212-222.
    [131]Zellig Harris. Mathematical Structures of Language [M]. New York: Wiley-Interscience,1968.
    [132]Giovanni Modica, Avigdor Gal, Hasan M. Jamil. The Use of Machine-Generated Ontologies in Dynamic Information Seeking [C]. Proceedings of the 9th International Conference on Cooperative Information Systems. Heidelberg:Springer Berlin,2001: 433-448.
    [133]Missikoff M., Navigli R., Velardi P. Integrated approach for web ontology learning and engineering [J]. IEEE Computer, 2002,35(11):54-57.
    [134]Volz R, Oberle D, Staab S, Studer R. OntoLiFT Prototype [EB/OL].IST Project 2001-33052 WonderWeb Deliverable 11,2003.
    [135]Mehrnoush Shamsfard, Ahmad Abdollahzadeh Barforoush. Learning Ontologies from Natural Language Texts [J]. International Journal of Human-Computer Studies,2004, 60(1):17-63.
    [136]Philipp Cimiano, Johanna Volker, Rudi Studer. Ontologies on Demand? A Description of the State-of-the-Art, Applications, Challenges and Trends for Ontology Learning from Text [J]. Information, Wissenschaft und Praxis,2006.10, 57(6-7):315-320.
    [137]杜文华.本体的构建及其在数字图书馆中的应用研究[D].武汉大学,2005.4.
    [138]张春霞.领域文本知识获取方法研究及其在考古领域中的应用.中国科学院计算技术研究所,2005.6.
    [139]刘柏崇.基于Web的通用本体学习研究[D].浙江大学,2007.1.
    [140]何琳.古农学本体的半自动构建及检索研究[D].南京农业大学,2007.6.
    [141]傅魁.基于Web的本体学习研究[D].武汉理工大学,2007.10.
    [142]李景.领域本体的构建方法与应用研究[D].中国农业科学院农业信息研究所,2009.2.
    [143]Yu Juan, Dang Yanzhong. Learning Domain Feature from Text Corpora [C]. In Management Track within WiCOM:Engineering, Services and Knowledge Management (EMS'2008), Dalian, China, October 12-17,2008.
    [144]Roberto Navigli, Paola Velardi. Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites [J]. Computational Linguistics. MIT press,2004, 30(2):151-179.
    [145]G. Salton, C.Buckley. Term-weighting Approaches in Automatic Text Retrieval [J]. Information Processing & Management,1988,24(5):513-523.
    [146]HIT IR Lab. http://ir. hit. edu. cn/demo/ltp/Sharing_Plan.htm. (Accessed on Jan 1,2007).
    [147]Wikipedia. Principle of Compositionality. http://en.wikipedia.org/wiki/Com positionality, (Accessed on Nov 27,2008).
    [148]Klaas Dellschaft, Steffen Staab. On How to Perform a Gold Standard based Evaluation of Ontology Learning [C]. Proceedings of International Semantic Web Conference (ISWC-2006), Athens, GA, USA:Springer, LNCS,2006.9:228-241.
    [149]Peter Spyns, Marie-Laure Reinberger. Lexically Evaluating Ontology Triples Generated Automatically from Texts [C]. Proceedings of the second European Conference on the Semantic Web (ESWC 2005), Heraklion, Crete, Greece,2005.5: 563-577.
    [150]Marta Sabou, Chris Wroe, Carole Goble, et al. Learning Domain Ontologies for Semantic Web Service Descriptions [J]. Journal of Web Semantics,2005,3(4): 340-365.
    [151]全国科学技术名词审定委员会.http://www. cnctst. gov. cn/.
    [152]国家科委,中国科学院,国家教委,新闻出版署.关于使用全国自然科学名词审定委员会公布的科技名词的通知.1990.6.23.
    [153]中华人民共和国国务院.国函(1987)142号.1987.8.12.
    [154]国家自然科学基金委员会.http://www. nsfc. gov. cn/nsfc2009/index. htm.
    [155]于娟,党延忠.结合词性分析与串频统计的词语提取方法[J].系统工程理论与实践,2010,30(1):105-111.
    [156]于娟,党延忠.领域特征词的提取方法研究[J].情报学报,2009,28(3):368-373.
    [157]高素婷.科技名词审定工作实践与体会[J].中国科技术语,2009,11(01):11-15.
    [158]祁国荣.群策群力,把名词审定与释义工作做好[J].科技术语研究,2005,7(3):27-28.
    [159]李启斌.对当前名词审定工作的几点建议[J].科技术语研究,2000,2(3):30-31.
    [160]金岳霖.形式逻辑[M].北京:人民出版社,1979.10:54-57.
    [161]于娟,王贱珍,马金平,李永.基于学科体系的OWL知识表示[J].现代图书情报技术,2006,136(5):18-21.
    [162]于娟,王贱珍,马金平,李永.基于课程体系的OWL知识表示方法研究[J].现代图书情报技术,2006,134(3):5154.
    [163]于娟,马金平,李永.基于Web本体语言OWL的知识表示[J].计算机工程与设计,2006.27(22):4356-4357.
    [164]Graham Klyne, Jeremy J. Carroll. Resource Description Framework (RDF):Con cepts and Abstract Syntax [EB/OL]. http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/, (Accessed on Jul.14,2004).
    [165]Brickley D, Guha R. Resource Description Framework (RDF) Schema Specification [EB/OL]. http://www.w3.org/TR/2000/CR-rdf-schema-20000327, (Accessed on Jul.14, 2004).-103-

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700