用户名: 密码: 验证码:
融合本体和用户兴趣的专利信息检索系统的研究与实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
专利文献集技术、法律与经济等信息于一体,反映了最新的科学技术研究情况。自增强自主创新能力在全国科技大会上被提出以来,作为对科技创新有重要指导作用的专利信息服务受到越来越多地关注,专利检索作为专利信息服务的核心,为专利文献的获取提供了一个有效的途径。
     本文通过分析和比较国内外著名的专利信息检索系统,了解到目前广泛使用的专利检索系统普遍存在着检索智能化程度不高,无法获取检索条件的语义信息以及不能体现用户的个性化需求等不足。随着1998年Tim Berners-Lee提出语义网的概念,本体作为一种能在语义和知识层次上描述信息系统的概念模型的建模工具得到越来越多的应用。本文在介绍本体相关技术的基础上,设计了专利检索领域本体、国际专利分类表和用户兴趣模型的本体表示,实现了融合本体和用户兴趣模型的专利信息检索系统。
     本文的主要工作包括以下四点:
     (1)通过分析专利检索的常用概念,设计了专利检索领域本体和国际专利分类表(International Patent Classification,IPC)的本体表示;
     (2)根据用户的需求以及历史检索记录,从用户主动提供和系统被动学习两方面着手,设计了用户兴趣模型的本体表示;
     (3)为满足不同层次、不同需求的用户,设计并实现了三种专利检索方式:快速检索、表格检索、高级检索,并将本体、用户兴趣模型、用户输入的检索条件三者相结合,实现满足用户需求的个性化检索;
     (4)通过设计将检索结果按IPC号分类,由用户自主选择需要浏览的类别来为用户提供个性化服务。由此避免了当用户输入条件范围过大时,检索的查准率大大降低,用户看到过多的不相关信息。
     经过对所设计系统的测试,证实通过本体和用户兴趣模型在专利检索中的应用,在一定程度上提高了专利信息检索的查全率和查准率。最后,本文对所做工作进行了总结,并给出了进一步完善该系统需要解决的若干问题。
Patent literatures gather informations from domains including technique, law and economy and reflect latest technology research status. Since enhancing independence and innovate ability was proposed, patent information service gets increasing attention because of showing technique developing direction. Patent retrieval as the core of patent information service proposes an effective approach to gain patent literatures.
     Firstly, analyze and compare some famous patent retrieval systems both here and abroad. Their intelligentization hierarchy of present extensively employed patent retrieval systems is not high. They are unable to get semantic information from the search conditions and can’t express users’personalized demands. Semantic web was proposed by Tim Berners-Lee in 1998. Ontology as a modeling tool describes concepts of information term system in semantic and knowledge hierarchy is applied ever-increasinly. This paper designs the patent retrieval domain ontology based on the introducing ontology technique, then uses the ontology to express the international patent classification table and users’interests, and also implements the patent retrieval system fusing ontology and users’interests.
     The main works of this paper include four points as follows:
     (1) Patent retrieval domain ontology is designed through analyzing the common concepts in searching patent information. And the international patent classification table is expressed using ontology;
     (2) Design users’interests model from the information actively provided by users and passively provided by system based on users’requirements and historical searching record;
     (3) Design and implement three patent retrieval modes: quick search, table search, and advance search to meet various users’demands. And combine ontology, user’s interest model and the inputed retrieval condition when users search for patent informations. This realizes individualized retrieval which meets the demands of users;
     (4) When users’s search term is inaccurate, the precision will drop markedly and users will receive many disrelated informations. This paper designs the classification of results according to the IPC. Users can choose any sort to browse by their own needs.
     Through testing the system, this paper validates that the application of ontology and user’s interest model improves the recall and the precision of patent information retrieval to a certain extent. Finally, this paper summarizes all of the work and raises further work to make the system more perfect.
引文
[1] 郭炜强, 戴天, 文贵华. 基于领域知识的专利自动分类[J]. 计算机工程, 2005, 31(23): 52-54
    [2] 马海群. 网络环境下的国际专利分类法IPC变革与发展[J]. 现代图书情报技术, 2002, 6: 41-43
    [3] 欧洲专利局专利检索系统 esp@cenet[EB/OL] http://ep.espacenet.com
    [4] 美国专利商标局专利检索系统 USPTO[EB/OL] http://www.uspto.gov/
    [5] MicroPatent 专利检索系统[EB/OL]http://www.micropat.com
    [6] DelPhion 知识产权网[EB/OL]http://www.delphion.com/simple
    [7] 世界知识产权组织专利数据库 WIPO[EB/OL] http://www.wipo.int/
    [8] 英国 Derwent 专利数据库[EB/OL]http://www.derwent.co.uk/
    [9] STO's Internet Patent Search System[EB/OL] http://sunsite.unc.edu/patents/intropat.html
    [10] 日本专利局数据库 JPO[EB/OL]http://www.jpo.go.jp/
    [11] 德国专利商标局 DEPATISnet 数据库[EB/OL] http://www.dpma.de/suche/suche.html
    [12] 中国国家知识产权局专利检索系统 SIPO[EB/OL] http://www.sipo.gov.cn/sipo/zljs/
    [13] 中国知识产权网专利检索系统[EB/OL]http://www.cnipr.com/
    [14] 中国专利信息网[EB/OL]http://www.patent.com.cn
    [15] 台湾专利公报资料库[EB/OL]http://twp.apipa.org.tw/
    [16] 中国专利信息中心[EB/OL]http://www.cnpat.com.cn/
    [17] 廖明宏. 本体论与信息检索[J]. 计算机工程. 2000,26(2):56-58
    [18] 李培,孙琳.数字图书馆信息资源本体论的构建[J]. 图书情报工作. 2003,6:24-27
    [19] 多语农业术语汇编系统[EB/OL]http://www.fao.org/agris/
    [20] FOAF 项目[EB/OL]http://www.foaf-project.org
    [21] Cost R S, et al. ITTALKS:A case study in DAML and the semantic Web[J]. IEEE Intelligent Systems, 2002, 17(1): 40~47
    [22] Ontobroker[EB/OL]. http://ontobroker.aifb.uni-karlsruhe
    [23] SKC[EB/OL]. http://www.db.stanford.edu/skc
    [24] Fridman N, Hafner C D. The State of the Art in Ontology Design [J]. AI Magazine, 1997, 18(3): 53-74
    [25] Ashenhurst. Ontological Aspects of Information Modeling [J]. Minds and Machines, 1996, 6: 287-394
    [26] Borst W N. Construction of Engineering Ontologies for Knowledge Sharing and Reuse [D]. PhD thesis, University of Twente, Enschede, 1997
    [27] 杜小勇, 李曼, 王珊. 本体学习研究综述[J]. 软件学报, 2006, 17(9): 1837-1847
    [28] Neches R, Fikes R, Gruber T, et al. Enabling Technology for Knowledge Sharing [J]. AI Magazine, 1991, 12(56): 80-91
    [29] Gruber T. A translation approach to portable ontology specifications [J]. Knowledge Acquisition, 1993, 5(2): 199-220
    [30] Studer R, Benjamins V and Fensel D. Knowledge Engineering, Priciples and Methods [J]. Data and Knowledge Engineering, 1998, 25(122): 161-197
    [31] Fensel D. Ontologies: Silver Bullet for Knowledge Management and Electronic Commerce[M]. Springer Verlag. 2001
    [32] Davies J and Weeks R. QuizRDF:Search Technology for the Semantic Web[C]. Proceedings of the 37th Hawaii International Conference on System Sciences, 2004
    [33] Dublin Core Technology[EB/OL]. http://www.metadata.com.cn/BulinCore.htm
    [34] Fensel D, Angele J, Decker S, et al. On2broker: Semantic-Based Access to Information Sources at the WWW[C]. In World Conference on the WWW and Internet (WebNet 99), 1999
    [35] 杜剑峰. 网络信息集成系统的研究[D]. 中山大学, 2002
    [36] GENEONTOLOGY[EB/OL].http://www.geneontology.org/doc/GO.indices.html
    [37] Miller E. Weaving Meaning: Semantic Web Applications[C]. Presented at INTAP Interoperability Technology Association for Information Processing, November 11, 2003, Tokyo, Japan
    [38] Stevens R, et al. TAMBIS: Transparent access to multiple bioinformatics information sources[J]. Bioinformatics, 2000, 16(2):184~185
    [39] Payne T R, Singh R , Sycara K. RCAL: A case study on semantic Web agents[C]. The 1st Int'l Conf of Autonomous Agents and Multiagent Systems, Bologna, Italy, 2002
    [40] Agentcities[EB/OL]. http://www.agentcities.org/
    [41] 宋炜, 张铭. 语义网简明教程[M]. 高等教育出版社. 2004,6,第一版 117-118
    [42] Uschold M. Knowledge Level Modeling: Concepts and Terminology [J]. The Knowledge Engineering Review, 1998, 13(1): 5-29
    [43] Uschold M, Gruninger M. Ontologies: Principles, Methods and Applications [J]. Knowledge Engineering Review, 1996, 11(2): 93-155
    [44] Guarino N. Semantic Matching: Formal Ontological Distinctions for Information Organization, Extraction and Integration[C]. In: Pazienza M T, eds. Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology, Springer Verlag, 1997: 139-170
    [45] 邓志鸿, 唐世谓, 张铭等. Ontology 研究综述[J]. 北京大学学报(自然科学版). 2002, 38(5):730-738
    [46] Guarino N. Formal Ontology and Information Systems [J]. In: Proceedings of the First Conference (FOIS’98), Trento, Italy, Amsterdam, IOS Press, 1998: 3-15
    [47] Gruber T. Towards Principles for the Design of Ontologies Used for Knowledge Sharing [J]. International Journal of Human-Computer Studies, 1995, 43: 907-928
    [48] 陈禹. IDEF 建模分析与设计方法[M]. 北京: 清华大学出版社, 1999
    [49] IDEF 网站[EB/OL]. http://www.idef.com/
    [50] Gruninger M and Fox M S. Methodology for the Design and Evaluation of Ontologies[C]. Workshop on Basic Ontological Issues in Knowledge Sharing, IJCAI-95, Montreal. 1995
    [51] Fernandez M, Gomez-perez A and Jurristo N. METHONTOLOGY: From Ontological Art Towards Ontological Engineering[C]. AAAI-97 Spring Symposium on Ontological Engineering, Stanford University, March 24-26th, 1997
    [52] Farquhar A, Fikes R and Rice J. The Ontolingua Server: A Tool for Collaborative Ontology Construction[C]. Knowledge Systems Laboratory, 1996, 9
    [53] Swartout B, Patil R, Knight K, et al. Ontosaurus: a tool for browsing and editing ontologies[C]. Gaines B R and Musen M A. Proceedings of Tenth Knowledge Acquisition Workshop. http://ksi.cpsc.ucalgary.ca/KAW/KAW96/swartout/ontosaurus_demo.html, 1996.
    [54] Domingue J, Tadzebao and Webonto. Discussing, Browsing and Editing Ontologieson the Web[C]. In: Proceedings of the 11th Knowledge Acquisition, Modeling and Management Workshop(KAW98). Banff, Canada, 1998
    [55] Musen M A, Gennari J H, Eriksson H, et al. PROT EG E-|| : A computer support for development of intelligent systems from libraries of components[C]. In Proceedings of the 8th World Congress on Medical Informatics (MEDINFO-95), 1995: 766-770
    [56] Sure Y, Erdmann M, Angele J, et al. OntoEdit: Collaborative Ontology Engineering for the Semantic Web[C]. In Proceedings of the International Semantic Web Conference 2002 (ISWC 2002), June 9-12 2002, Sardinia, Italia
    [57] Lassila O, Swick R. Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation[EB/OL]. http://www.w3.org/TR/PR-rdf-syntax, January,1999
    [58] Brickley D, Guha R V. Resource Description Framework (RDF) Schema Specification, W3C Proposed Recommendation[EB/OL]. http://www.w3.org/TR/PR-rdf-schema, March, 1999
    [59] Horrock I, van Harmelen F. Reference description of the DAML+OIL ontology markup language[EB/OL]. Draft report. http://www.daml.org/2000/12/reference.html, 2001.
    [60] Pulido J R G, Ruiz M A G, et al. Ontology languages for the semantic web: A never completely updated review [J]. Knowledge-Based Systems, 2006, 19: 489-497
    [61] Heflin J, Volz R and Dale J. Requirements for a Web Ontology Language[EB/OL]. http://www.daml.org/TR/2002/WD-webont-req-20020708/, 2002
    [62] 张红. 语义网中的本体推理及其应用研究[D]. 吉林大学. 2004
    [63] 陈琮. 基于 Jena 的本体检索模型设计与实现[D]. 武汉大学. 2005
    [64] McBride B. An Introduction to RDF and the Jena RDF API[EB/OL]. http://jena. sourceforge. net/ tutorial/RDF_APL/index. html , Accessed Oct. 17,2004
    [65] 丁晟春, 顾德访. Jena 在实现基于 Ontology 的语义检索中的应用研究[J]. 现代图书情报技术, 2005, 10: 5-9
    [66] 杜小勇, 马文峰. 学科领域知识本体建设方法研究[J]. 图书情报工作, 2005, 49 (8): 74-78
    [67] 缪涵琴, 孙涌. 专利信息本体的设计与应用[J] . 计算机技术与发展, 2007

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700