Ontology在农业信息管理中的构建和转化
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
Ontology(本体论)最早是哲学上的一个概念,后被应用于计算机人工智能研究领域。20世纪90年代起,研究者将其应用于信息系统研究领域,主要概念是指某一专业领域的全部术语、对术语的定义以及术语间的关系构成的知识系统。
     近年来,图书情报界和其它许多领域都对本体论表现出浓厚的兴趣,从不同角度研究了它在知识组织、情报检索等领域的应用潜力。为了探索本体论在农业信息管理中的理论和应用价值,研究叙词表在网络信息时代的发展方向,本论文研究了领域本体论的构建、以及从叙词表向本体论的转化方法。
     在系统介绍和分析了本体论构建理论的基础上,使用目前在国际上性能比较完善的本体论构建和维护工具KAON,进行了本论文的研究工作。论文把本体论的理论方法与传统的知识组织工具结合起来,利用已有的知识和技术,具体实现了本体论原型的构建。研究的核心是以食物安全本体论的构建作为分析案例,具体构建方法可以划分为4个步骤:
     第一步是通过领域专家的参与,挑选典型的5篇食物安全论文,选择其中的重要词汇,人工构建了一个含有58个术语的食物安全核心本体论。
     第二步选择中国农业科学叙词表中作物分类大类(989个词汇),通过JAVA程序,将叙词表数据库转化为KAON格式文件,生成KAON可读叙词表本体论,再人工选择一些叙词表中与食物安全有直接关系的叙词,按语义关系加入到本体论中,完成从叙词表向本体论的转化。
     第三步是将前面的两个本体论合并,借助现有网络搜索引擎,在因特网上人工挑选60条与食物安全相关的论文、会议等各类信息,组成一个16万字的食物安全语料库,人工选择重要词汇,统计词频,将合适的词汇也加入到合并的本体论中,则基本建成了食物安全本体论。
     第四步即进行定期的维护和更新。通过食物安全一个专业领域的研究,总结具体构建的过程和方法,将来即可依据类似方法,建立农业其它领域的本体论,甚至整个农业领域的本体论。
     论文其它部分则围绕本体论相关的主题,展开相应的调查和分析,总结出本体论在国内研究属于起步阶段,从多语种叙词表转化本体论是一个可行的环节,并展望了本体论在农业信息领域的应用前景等。
     本研究是在国内图书情报领域最早研究本体论的论文之一,首次将部分中国农业科学叙词表转化为RDFS格式本体论,这是论文的重要创新成果之一。论文构建了食物安全本体论原型,无论是构建方法方面,还是从叙词表进行转化,研究成果都具有创新性,对国内开展本体论理论和实践研究,具有参考价值;在我国农业信息管理领域,对实现智能化信息处理和信息检索,具有重要意义。
The term "ontology," derived from philosophy, is the study of being and existence. More recently, within the context of computer science, it has come to mean a conceptual schema consisting of rigorously defined concepts, attributes, relations, and rules defining a given domain.
    In recent years, many specialists in libriary science and information science, have great interest in the research of ontology, they research its potential applications in knowledge organization and information retrieval from various angles. In order to search after the theory and application value in agricultural information management, and to research the trend of thesaurus in the web age, this dissertation research the construction way of ontology, and the conversion way from thesaurus.
    Based on the introduction of ontology theory, the author uses KAON, the construction and maintainance tool of ontology, designed and completed the first Chinese food safety ontology prototype.
    This dissertation presents the methods used to build a food safety ontology prototype in Chinese. First, it describes the use of the Kaon Suite of tools to create the ontology. Second, it describes the selection of typical food safety papers from the web, from which 58 terms within the food safety domain were extracted to create the food safety core ontology. Third, about a thousand terms from the Chinese agricultural Thesaurus were selected and then incorporated programmatically into the food safety ontology using Java; additional food safety concepts from the Thesaurus were manually added. Fourth, the core and thesaurus ontology were combined. A corpus of 60 documents on food safety was collected, and merged into a single file. The file contained 160 thousand Chinese characters. The associated terms were sorted by frequency, and the high-frequency terms added to the ontology. The final work involved revision and maintenance. Using this methodology, a Chinese Food Safety Ontology prototype was created. Howe
    ver, this procedure can be used to create other domain ontologies. The author also discusses the conversion of a thesaurus to an ontology, the uses of ontology, and so on.
    This dissertation is one of the earlist ontology research in library and information science in China, the author first transfer a part of the Chinese Agricultural Thesaurus to RDFS ontology in China. Both the way of ontology construction and the transfer from thesaurus are innovative. It is useful to the research of theory and practice in China, and is helpful to develop intellectual information retrieval engine.
引文
[1] 百度. http://www.baidu.com(Accessed March 23, 2004)
    [2] 白清云.农产品中环境污染物的控制标准与食物安全体系.农业环境保护,2000,(4):230-233
    [3] 曹军.Google的PageRank技术剖析.情报杂志,2002,(10):15-18
    [4] 曹树金,罗春荣.信息组织的分类法与主题法.北京图书馆出版社,2000:438.
    [5] 常春.第四媒体:信息技术与企业机遇.安徽科技与企业,2000,(1):5.
    [6] 常春,卢文林.叙词表编制历史、现状与发展.农业图书情报学刊,2002,(5):25-28.
    [7] 常春.联合国粮食与农业组织AOS项目.农业图书情报学刊,2003a,(2):14-15,24
    [8] 常春.Ontology在农业情报检索领域的研究背景.现代图书情报技术,2003b,(6):4-7
    [9] 常春.网络信息检索存在的问题及发展方向.情报杂志,2003c,(11):93-94
    [10] 邓志鸿,唐世渭,张铭,等.Ontology研究综述.北京大学学报(自然科学版),2002,38(5):730-738.
    [11] 丁声俊.居安思危确保国家食物安全.求是,2003,(21):33-34
    [12] 冯传岗.BCTV亟待进入Internet领域.中国有线电视,2002,(7):63-67
    [13] 国际标准化组织,ISO 5964文献工作——多语种叙词表的编制与修订准则,第1版,1985-02-15.
    [14] 郭祥文,刘惟一,钱民,等.基于本体论的信息检索.云南大学学报(自然科学版),2003,25(4):324-327
    [15] 简清国,林茂竹.ontology.英汉多功能词典.北京:外语教学与研究出版社,1997:1037
    [16] 李冠强.数字图书馆研究.北京图书馆出版社,2002
    [17] 李国辉,汤大权,武德峰.信息组织与检索.科学出版社,2003,283-307
    [18] 李艳梅.浅议搜索引擎Google.现代情报,2003,(7):59-61
    [19] 李朝葵,陶卫国.语义检索.情报科学,2002,20(11):1190-1192.
    [20] 梁树柏.检索语言创新的实践论——以分类口岸为例.图书情报工作,2003,(10):
    [21] 廖明光,王华敏,廖明宏.数字图书馆的智能检索技术.电脑学习,2001,(3):2-3.
    [22] 廖明宏,程光明,吴翔虎.一个WWW智能搜索引擎.计算机应用研究,2001,(5):29-31.
    [23] 廖明宏.本体论与信息检索.计算机工程,2000,26(2):56-58.
    [24] 凌美秀.关于搜索引擎当前存在的主要问题及其发展趋势的探讨.高校图书馆工作,2002,(21):29-33
    [25] 陆履亨.网络信息资源管理概论.上海人民出版社,2003
    [26] 刘柏蒿.一种面向语义Web的数字图书馆框架.大学图书馆学报,2003,(1):13-16,32
    [27] 刘炜,张亮.数字图书馆的体系结构与元数据方案.情报学报,2003,22(5):148-154
    [28] 刘娇蛟,龚丽,李建华.基于本体实现对网页文本的自动主题分类.计算机工程,2003,29(11):95-97
    
    
    [29] 刘湘生.主题法的理论与标引.书目文献出版社,1985:21-28.
    [30] 卢良恕.新时期我国农业结构战略性调整与食物安全.中国食物与营养,2002,(4):4-7
    [31] 马张华,侯汉清.文献分类法主题法导论.北京图书馆出版社,1999:165.
    [32] 梅方权.关于食物安全保障概念的发展分析.中国食物与营养,2002,(5):4-6
    [33] 农业部情报研究所.农业科学叙词表.中国农业出版社,1994:1078
    [34] RDF——Web数据集成的元数据解决方案.http://www.jiafangyifang.com/news/newsdata/2001_12_04/4/20011204161934.htm(Accessed April 3,2004)
    [35] 宋翠云.中外叙词表的发展.河南图书馆学刊.1995,(4):27-28.
    [36] 田梅,王兰成,侯双.主题词表及其语义环境的计算机构建与应用研究.情报学报,2003,22(5):572-576
    [37] 王林.网络环境中信息检索的特点及发展趋势.图书馆学研究,2002,(2):91-92.
    [38] 王念滨.基于本体论的知识集成方法及系统实现技术.哈尔滨工业大学博士论文,2001.
    [39] 王泰森.一个基于本体论全文自动标引方案.情报科学,2003,21(9):950-952
    [40] 武成岗,焦文品,田启家,等.基于本体论和多主体的信息检索服务器.计算机研究与发展,2001,38(6):641-647.
    [41] 吴国文,顾宁,施伯乐.利用本体簇生成标准概念模式.计算机研究与发展,2001,(12):1499-1504
    [42] 吴潇.多媒体信息检索研究.情报杂志,2002,21(10):19-21
    [43] 新英汉词典编写组.ontology.新英汉词典.上海译文出版社,1978:907
    [44] 邢平平,施鹏飞,赵奕,基于本体论的数据挖掘方法.计算机工程,2001,27(5):15-16,59.
    [45] 许涛,吴淑燕.Google搜索引擎及其技术简介.现代图书情报技术,2003,(4):58-61
    [46] 杨秋芬,陈跃新.Ontology方法学综述.计算机应用研究,2002,(4):5-7.
    [47] 杨晓青,陈家训.一种利用RDF(S)建立本体论的方法.计算机应用研究,2002,(4):54-57
    [48] 杨晓青,陈家训.语义Web.计算机应用研究,2002,(6):60-63,67.
    [49] 叶鹰.信息科技的形式本体论研究.情报学报,2003,22(5):561-564
    [50] 曾蕾.联机环境中的情报检索语言.书目文献出版社,1996:214-233.
    [51] 张立富,等.中国的粮食安全问题.农业经济,2002,(3):39-40
    [52] 张琪玉,丘峰,翟凤岐.情报检索语言论文选.书目文献出版社,1990:352.
    [53] 张晓林.Semantic Web与基于语义的网络信息检索.情报学报,2002,21(4):413-420
    [54] 张燕,惠佳颖.网络搜索引擎评价.现代图书情报技术,2001,(4):34-36,58
    [55] 中国大百科全书总编辑委员会《哲学》编辑委员会.中国大百科全书哲学卷.北京:中国大百科全书出版社,1987:35
    [56] 中国翻译网 http://www.chinatranslate.net/IT/it4004.htm(Accessed April 3, 2004)
    [57] 中国互联网络发展状况统计调查.http://www.cnnic.net.cn/index/0E/00/11/index.htm(Accessed March 27, 2004)
    [58] 中国机器翻译的世纪回顾.http://www.keenage.com/html/c_index.html(Accessed April 3, 2004)
    
    
    [59] “中国图书分类法”回顾与前瞻座谈会辑要.http://www.ncl.edu.tw/pub/c_news/90/05.html(Accessed May 16,2004)
    [60] 中国图书馆分类法编委会.中国图书馆分类法(第4版).北京:北京图书馆出版社,1999.
    [61] 中华人民共和国国家标准,GB13190—91,汉语叙词表编制规则.中国标准出版社,1991
    [62] 中华人民共和国国家标准,GB/T15417—94,文献 多语种叙词表编制规则.中国标准出版社,1995:1
    [63] 中文图书标题表.http://datas.ncl.edu.tw/catweb/2-1-4.htm(Accessed May 16,2004)
    [64] 朱俊卿.搜索引擎Google研究.现代图书情报技术,2002,(1):45-47
    [65] About the AOS. http://www.fao.org/agris/aos (Accessed August 11, 2003)
    [66] About the DAML Language. http://www.daml.org/about.html (Accessed March 8, 2004)
    [67] Pease A., Niles I., Li J. The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applications, in Working Notes of the AAAI-2002 Workshop on Ontologies and the Semantic Web. http://home.earthlink.net/~adampease/professional/(Accessed April 4, 2004)
    [68] AGROVOC, Multilingual Agricultural Thesaurus. Food and Agricultural Organization of the United Nations, 4th English version, Rome, 1999.
    [69] AI in design webliography, http://www.cs.wpi.edu/Research/aidg/AIinD-hotlist.html(Accessed March 7, 2004)
    [70] AI on the Web. http://www.cs.berkeley.edu/~russell/ai.html(Accessed March 7, 2004)
    [71] Algernon - Rule-Based Programming. http://algernon-j.sourceforge.net/(Accessed March 7, 2004)
    [72] Andrew Large,Jamshid Beheshti.OPACS :a research review.LISR, 1997,19:111-133
    [73] Boris Lauser, Tanja Wildemann,Allison Poulos,et al. A comprehensive framework for building muldlingual domain ontologies: creating a prototype biosecurity ontology. AFITA2002, Asian Agricultural Information Technology & Management. Proceedings of the Third Asian Conference for Information Technology in Agriculture. Edited by Mei Fangquan. China agricultural scientech press,2002:31-41
    [74] Chimaera. http://www.ksl.stanford.edu/software/chimaera/(Accessed March 7, 2004)
    [75] CODE4: Conceptually Oriented Description Environment. http://www.csi.uottawa.ca/~doug/CODE4.html(Accessed March 7, 2004)
    [76] DARPA/Rome Laboratory Planning Initiative. http://arpi.isx.com/(Accessed March 7, 2004)
    [77] Dagobert Soergel, et al. Reengineering Thesauri for New Applications: the AGROVOC Example. http://jodi.ecs.soton.ac.uk/Articles/v04/i04/Soergel(Accessed June 8, 2004)
    [78] Extensible Markup Language (XML)1.0. http://www.w3.org/TR/2004/REC-xml-20040204/(Accessed April 3, 2004)
    [79] Food and Agriculture Organization of the United Nations. AGROVOC: Multilingual Agricultural
    
    
    [80] Thesaurus.4th Edition English Version. Rome, 1999.
    [81] Gaines, B. Editorial: Using Explicit Ontologies in Knowledge-based System Development. International Journal of Human-Computer Systems, 1997,46:181.
    [82] Google~(TM). http://www.google.com (Accessed March 27, 2004)
    [83] Gruber T R. A translation approach to portable ontology specifications. Knowledge Acquisition, 1993,5(2): 199-220
    [84] Gruber T R."Toward principles for the design of ontologies used for knowledge sharing." 1993b. http://ksl-web.stanford.edu/KSL_Abstracts/KSL-93-04.html(Accessed June 10, 2004)
    [85] Gruber T R.Toward principles for the design of ontologies used for knowledge sharing.Int. Journal of Human and computer studies, 1995:907-928.
    [86] Guarino N. Formal ontology and information systems. In: N. Guarino, editor, Proceedings of the 1st International Conference on Formal Ontologies in Information Systems, FOIS'98, Trento, Italy, IOS Press, 1998:3-15
    [87] Index of/software, http://www.ksl.stanford.edu/software/(Accessed March 7, 2004)
    [88] Infrastructure and Applications for the Semantic Web. International Semantic Web Working Symposium (SWWS). http://www.semanticweb.org/SWWS/(Accessed December 15, 2003).
    [89] International Semantic Web Conference (ISWC). http://iswc.semanticweb.org/(Accessed December 15, 2003)
    [90] John Bateman's ontology portal. http://www.fb10.uni-bremen.de/anggistik/langpro/webspace/jb/info-pages/ontology/ontology-root.htm (Accessed March 7, 2004)
    [91] Knight K, Luk S.Building a Large Knowledge Base for Machine Translation. In Proceedings of American Association of Artificial Intelligence Conference (AAAI-94).Seattle, WA
    [92] Mahesh, K. Ontology Development for Machine Translation: Ideology and ethodology. New Mexico State University, 1996, Computing Research Laboratory MCCS-96-292.
    [93] Markup Languages and Ontologies. http://www.semanticweb.org/knowmarkup.html(Accessed March 8, 2004)
    [94] Neches R,Fikes R E,Gruber T R,et al. Enabling technology for knowledge sharing. AI Magazine, 1991, 12(3):36-56.
    [95] Papers Selected.http://www.keenage.com/html/c_index.html(Accessed April 3, 2004)
    [96] Philip Babcock Gove and the Merriam-webster Editorial Staff. Webster's Third New International Dictionary. G.& C.Merriam Company. 1976:1577
    [97] Qin Jian, Paling Stephen. Converting a controlled vocabulary into an ontology: the case of GEM. Information Research, 2001,6(2) Available at: http://InformationR.net/ir/6-2/paper94.html
    [98] Sowa J F. Knowledge Representation:Logical, Philosophical,and Computational Foundations. PWS Publishing Co.(forthcoming),Boston
    
    
    [99] Some Ongoing KBS/Ontology Projects and Groups. http://www.cs.utexas.edu/users/mfkb/related.html#kaon (Accessed March 7, 2004).
    [100] TouchGraph LLC. http://www.touchgraph.com/(Accessed March 10, 2004).
    [101] The Library of Congress. http://www.loc.gov(Accessed March 22, 2004)
    [102] The Onto Web and the DARPA Program. 1st International Semantic Web Conference (ISWC2002). http://iswc2002.semanticweb.org/(Accessed November 17, 2003).
    [103] Vannevar Bush.As we may think. http://www.theatlantic.com/unbound/flashbks/computer/bushf.htm
    [104] Web-Ontology (WebOnt) Working Group. http://www.w3.org/2001/sw/WebOnt/(Accessed April 4, 2004)
    [105] Welcome to KAON. http://kaon.semanticweb.org(Accessed March 12, 2004)
    [106] Welcome to the OIL. http://www.ontoknowledge.org/oil/(Accessed March 8, 2004)
    [107] Welcome to the Prot(?)g(?) Project. http://protege.stanford.edu/index.html(Accessed April 4, 2004)
    [108] What is an Ontology?http://www-ksl.stanford.edu/kst/what-is-an-ontology.html(Accessed March 8, 2004)
    [109] WonderWeb, Ontology Infrastructure for the Semantic Web. http://wonderweb.semanticweb.org/index.shtml (Accessed December 15, 2003)

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700