基于Ontology的Web政务资源的智能处理与应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
当前互联网的规模越来越大,但网络资源缺乏统一的语义描述,因而计算机难以有效理解Web上信息的含义,用户越来越难以从海量的数据中让计算机查找和处理自己需要的资源,同样的问题也存在于政务Web信息资源处理领域。
     所以笔者首先在本文中具体分析了当前电子政务信息资源处理中存在的这一问题。如政务信息资源共享程度低、互操作性差、关键词检索效果差、信息处理缺乏智能化和人性化的支持问题等等,而W3C组织倡导的基于本体的语义网技术为解决这些问题提供了出路,语义网通过获得词汇的语义,可以使政务Web应用不但能够表示信息,而且能够理解信息,还可在此基础上依据一定的规则对信息进行推理。
     随后本文介绍了语义Web的相关理论和技术概念,包括语义Web的概念、研究现状、体系结构和关键技术,本体的概念、类型、描述语言、构建方法和编辑工具等等。
     在分析了语义web和本体论的基础技术之后,本文提出了一个政务信息资源本体的获取模型,该模型以本体构建的循环获取方法论为理论指导,借助《电子政务主题词表》转化生成电子政务信息资源原始领域本体,并且在领域专家或系统管理者审核的前提下,领域本体可以在以后的语义信息提取过程中持续学习新概念实例,不断完善,从而使得以上两种本体构造方法能以一种互补互利的方式结合运用。
     随后根据中文自然语言信息处理的特点,并参照上述互补的本体构造方法,本文提出了一个Web政务资源的语义信息提取和标注模型,在对模型所用到的概念和参考资源作了分析之后,本文对模型中的中文语言的语义分析提取和网页语义标注这两个重点问题做了详细讨论。
     最后,本论文基于上述技术分析,对两个基于Ontology的Web政务资源的智能处理问题进行了初步的模型探讨,即市长信箱分类处理系统架构和语义政府门户网站的架构。
Nowadays web is growing amazingly, but the information lacks a uniform semantic description, so it is difficult for computer to understand information on web, and it is increasingly difficult to find, organize, access and maintain the information required by users. The same thing happens in the e-government information resources processing areas.
    So first the author analyzed the problems of the processing of government information Resources at present, such as low degree of sharing, bad effect of keyword search and lack of intellectualized and humanized support. Ontology-based Semantic Web technology proposed by W3C indicates a way to solve this problem. E-government web applications can represent and understand the information by obtaining the semantic of the words, even though infer from the information based on some roles.
    And then the paper introduces the related theory and technology of Semantic Web. including the concept、 research status、 architecture and key technology of Semantic Web, the concept、 types、 description languages, building and edit tools of Ontology.
    The paper studies the relevant problems in the process of ontology acquisition, including transforming e-government subject key word into domain ontology, and under the audit of e-government domain expert or ontology administrator, e-government domain ontology can continuously learning new ontology instance.
    According to traits of Chinese information processing, the paper puts forward a model of e-government semantic information automatic extraction and annotation. The model based on Semantic Web and discusses in detail four important issues: Chinese words automatic segmentation and extraction, semantic annotation.
    At last the paper studies two model involved in Ontology-based e-government information processing issue: a model of mayor mailbox classified automatic processing system; And then a model of ontology-based government semantic portal is presented to make it easy for users to access the right information which they want.
引文
[1] Tim Bemers-Lee. Semantic Web Roadmap, Oct 1998. http://www.w3.org/DesignIssues/Semantic.html
    [2] Tim Berners-Lee. Semantic Web—XML2000[EB/OL]. http://www.w3.org/2000/Talks/1206-xm12k-tb1/
    [3] Tim Berners-Lee, James Hendler, Ora Lassila. The Semantic Web. Scientific American, 2001, 284(5): 34-43
    [4] J Hendler, DL McGuinness. DARPA Agent Markup Language. IEEE Intelligent Systems, 2001, 15(6): 72-73
    [5] HP Lab, Jena-A Semantic Web Framework for Java, Technology Report, http://Jena.sourceforgc.net
    [6] Heflin J, HendlerJ. A Portrait of the Semantic Web in Action. IEEE Intelligent System, 2001-03/04
    [7] 邓志鸿,唐世渭,杨冬青.面向语义集成:本体在Web信息集成中的研究进展.计算机应用,2002;(1)
    [8] 毛军.基于RDF的主题词表研究.情报学报,2003;(2)
    [9] 田稷.语义Web与网络信息和知识的表达.情报杂志,2003;(6)
    [10] 廖乐健,曹元大,幺敬国,李守丽.一个语义Web架构及其实现.计算机工程与应用,2003;(15)
    [11] S. Seeker, D. Brickley, J. Saarela, and J. Angele. A Query and Inference Service for RDF. In Proceedings of the W3C Query Language Workshop(QL-98), 1998, http://www.w3.org/TandS/QL/QL98
    [12] Jorg-Uwe Kietz, Raphael Volz, Alexander Maedche, Extracting a Domain-Specific Ontology from a Corporate Intranet, Proceedings of the Fourth Conference on Computational Natural Language Learning and of the Second Learning Language in Logic Workshop, Lisbon 2000
    [13] Thorsten Brants, TnT-A Statistical Part-of-Speech Tagger, Seattle, WA, Proceedings of ANLP, 2000, 224-231
    [14] Natalya F. Noy, Michael Sintek, Stefan Decker, Creating Semantic Web Contents with protege2000, IEEE Intelligent Systems archive, Volume 16, Issue 2, P60-P71, 2001
    [15] 常春等.叙词表编制历史、现状与发展.农业图书情报学刊,2002(05):25-28
    [16] Guidelines for the Construction, Format, and Management of Monolingual Thesauri. http://www.niso.org/Resourcess/Z39-19, htm.Acceessed: Jan 2005
    [17] Dr. phil., Freiburg·Building a rich ontology from AGROVOC. FAO Agricultural Ontology Server Workshop Beijing, April 27—29, 2004
    [18] Maedche, A.,Staab, S.,Ontology learning for the semantic web. IEEE Intelligent Systems 16(2), 72—79, 2001
    [19] M. Missiko, R. Navigli, P. Velardi, Integrated approach for Web ontology learning and engineerine. IEEE Comouter 35(11): 60-63.2002
    [20] Mehmoush Shamsfard, Ahmad Abdollahzedeh Barforoush, Learning and ontoiogies from natural language texts, Int. J. Human-Computer Studies 60:17-63,2004
    [21] Uschold, M.; Griininger, M. (1996)ONTOLOGIES: Principles, Methods and Applications. Knowledge Engineering Review. Vol.11
    [22] 刘群等.基于层叠隐马模型的汉语词法分析.计算机研究与发展,2004(08):1421-1430
    [23] 刘群.汉语词法分析和句法分析技术综述、http://www.icl.pku.edu.cn/icl_tr/paper_2000-2003/2002/accessed: Mar., 2004
    [24] James Allen著(刘群等译).自然语言理解,电子工业出版社,2005年1月:162-163
    [25] 周强等.汉语最长名词短语的自动识别软件学报,2000,11(2):195-20;73
    [26] Church, K.W A StConference on AppliedochasticNatural1988: 136.143
    [27] 李素建等.统计和规则相结合的汉语组块分析.计算机研究与发展,2002(04):385-390
    [28] 穗志方等,汉语单句谓语中心词识别知识的获取及应用北京大学学报(自然科学版),1998(04):221-230
    [29] Peter F. Patel-Schneider, Patrick Hayes, Ian Horrocks. OWL Web Ontology Language Semantics and Abstract Syntax. http://www.w3.org/TR/owl-absyn/.2004
    [30] 田春虎.国内语义Web研究综述.情报学报2005.4
    [31] Sugumaran, Vijayan; Storey,Veda C: Ontologies for conceptual modeling: their creation, use, and management[J]. Data and Knowledge Engineering, Volume: 42, Issue: 3, September, 2002, pp. 251-271
    [32] Corcho, Oscar; Fernandez-Lopez, Mariano; Gomez-Perez, Asuncion: M ethodologies, tools and languages for building ontologies. Where Is their meeting point[J]? Data and Knowledge Engineering, Volume: 46, Issue: 1, July, 2003, pp. 41-64
    [33] http://cel.pku.edu.cn/北京大学汉语语言学研究中心
    [34] 温有奎等.基于xml平台的知识元本体推理.情报学报,2004(06Y643-64859)
    [35] 秦健.实用分类系统与语义网发展现状和研究课题.现代图书情报技术,2004(1):16-22
    [36] 杨芳、杨振山.基于语义网技术的主题词标注.计算机工程与设计.2005.10
    [37] 李景,钱平.主题词表与本体的区别与联系.中国图书馆学报,2004.(1)36-3
    [38] 徐剑军、张钋、李涓子、王克宏.智能化应用的基石——基于本体的语义Web知识处理平台.清华大学计算机系
    [39] 王本年、高阳、陈世福、谢俊元.Web智能研究现状与发展趋势.计算机研究与发展.2005
    [40] 潘小进,林子禹,张月强,贺春辉.语义Web的标记语言和体系结构.计算机工程2003年4月
    [41] 沈海波、洪帆.语义Web及其应用.微计算机应用.2005.9
    [42] 梅婧、刘升平、林作铨.语义Web语言的逻辑分析.北京大学,信息科学系
    [43] 田景熙、洪琢.电子政务系统规划与设计.人民邮电出版社.2005.07
    [44] Charles F.Goldfarb等著、张晓辉等译.XML手册(第四版)2003.3
    [45] 柴晓路.Web服务器架构与开放互操作技术2002.6
    [46] 中共中央办公厅国务院办公厅《关于转发<国家信息化领导小组关于我国电子政务建设指导意见>的通知》中办发[2002]17号文.
    [47] 国务院印发《全面推进依法行政实施纲要》(国发[2004]10号)
    [48] 《中共中央办公厅、国务院办公厅关于加强信息资源开发利用工作的若干意见》中办发[2004]34号文件
    [49] 《中央办公厅、国务院办公厅关于进一步推行政务公开的意见》(中办发[2005]12号
    [50] 中共中央办公厅、国务院办公厅《转发<国家信息化领导小组关于推进国家 电子政务网络建设的意见>的通知》(中办发[2006]18号)
    [54] 董振东.汉语分词研究漫谈语言文字应用,1997年第1期(总第21期)
    [55] Fernandez M. et al. METHONTOLOGY: From Ontological Art Towards Ontological Engineering, AAAI-97 Spring Symposium on Ontological Engineering, Stanford University, March 24-26th, 1997: 33-40
    [56] 杜小勇,李曼,王大治.语义Web与本体研究综述.计算机应用,20004(8):14-16,20
    [57] Eric Brill, A Simple Rule-Based Part of Speech Tagger, Italy, In Proc. Of third Conference of Applied Natural Language Processing, 1992, 152-155

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700