面向本体映射的语义相似度计算方法研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
语义网是解决现行Web不能自动处理海量信息的有效途径。本体作为一种领域知识概念化的方法,是语义网的基础。Web本身的分布性使得不同的用户根据不同的应用需求构建合适的本体。这些本体所描述的内容在语义上重叠或关联,但在表示语言和表示模型上却具有差异,这便造成了本体异构。本体映射能够很好地解决本体异构问题。映射过程中,核心内容是概念间相似度的计算。
     MD3模型是一种典型的概念间相似度计算方法。它基于本体描述,分别从概念名称、特征属性以及语义邻居三个方面计算相似度,然后加权综合得到概念之间的综合相似度,然而它还存在一些不足。
     本文在分析MD3模型的基础上,增加了非层次关系以及实例对概念相似度的影响,提出了MD4模型。并进而提出了基于MD4模型的本体映射机制,构建了相应的本体映射流程,设计实现了基于MD4模型的本体映射算法;搭建了本体映射的实验平台。三组本体映射的对比实验显示,在同等条件下,与MD3模型相比,MD4模型在返回率和精确率上都有所提高。相信随着本体技术的不断发展,MD4模型的优势还有待于进一步发现。
Semantic Web is a good way to solve the problem that the current Web can't process the massive information automatically .Ontology is the base of the Semantic Web because it is a good method of conceptualization of the domain knowledge .The distribution of the Web makes that different users develop their own ontologies according to their requirement .The content of these ontologies is overlapped or related in their semantic,but the description language and the model are different ,so the ontology heterogeneous has generated .Ontology mapping can solved the problem well. And in the process of ontology mapping ,determining the similarity of the concepts across different ontologies is the key.
     The Triple Matching Distance Model(MD3) is a typical method to determine the similarity of concepts from different ontologies .The model is on the basis of ontology's representations ,to determining the similarity from three facets: (1) lexicon matching, (2) feature matching, and (3)semantic-neighborhood matching .The global similarity is then a weighted sum of the similarity of each component. MD3 model is a good method, but it has some shortcomings.
     On the basis of analysis to the MD3 model, taking into account the influence of non_hiberarchy relations among concepts and the instances of concepts, the paper has proposed the MD4 model .On the basis of the MD4 model, we have proposed the ontology mapping mechanism ,designed the flow of ontology mapping ,and implemented the arithmetic of ontology mapping , and then have built the experiment flat of ontology mapping .Tree groups experiment results show that MD4 model is better in the recall and precision than the MD3 model in the same condition .With the development of the ontology, it is true that the advantages of the MD4 model will be found in the near future.
引文
[1]陆建江,张亚非,苗壮,周波,语义网原理与技术[M]北京:科学出版社,2007.03
    [2]J.Farrugia,Model-theoretic semantics for the Web[C],International World Wide Web Conference-Proceedings of the twelfth international conference on World Wide Web New York,USA:ACM Press,2003
    [3]程工,外语教学与研究[M].北京:外语教学与研究出版社,2003,03
    [4]陈意云,形式语义学基础[M].合肥:中国科技大学出版社,1994
    [5]宋国新,邵志清译,程序设计语言的形式语义[M].北京:机械工业出版社,2004年1月
    [6]M.Uschold,Where are the semantics in the semantic Web?[J],Knowledge Engineering Review,2003.
    [7]T.R.Gruber,A translation approach to portable ontology specificaions[J],Knowledge Acquisiton 5(2),pp.199-220,1993
    [8]唐杰、梁邦勇、李涓子、王克宏,语义Web中的本体自动映射[J],计算机学报,Vol.29(11),2006
    [9]J.M.A.Doan,Pedro Domingos,Leaning to Map between Ontologies on the Semantic Web,In Proc.of World-Wide Web Conf,Hawaii,USA,2002,pp.662-673.
    [10]邓志鸿、唐世渭等,Ontology研究综述[J],北京大学学报(自然科学版),vol.第38卷第5期,2002年9月
    [11]W.N.Borst,Construction of Engineering Ontologies for Knowledge Sharing and Reuse[D].PhD thesis:University of Twente,Enschede,1997.
    [12]D.Fensel,Ontologies:Silver Bullet for Knowlege Management and Electronic Commerce[J],Springer,2001.
    [13]M.Uschold,Knowledge level modelling:concepts and terminology[J],The Knowledge Engineering Review,Vol.13:1,pp.5-29,1998
    [14]D.Fensel,The semantic Web and its languages[J],IEEE Computer Society,vol.15(6),pp.67-73,2000
    [15]G.V.R.Lenat D,Building large knowledge-based systems:representation and inference in the CYC project[J],1990
    [16]G.A.Miller,WORDNET:An on-line lexical database[j],International Journal of Lexicography,vol.3(4),pp.235-312,1990.
    [17]P.A.Niles I,Toward a standard upper ontology[C],Proceedings of the Second International Conference on Formal Ontology in Information Systems,Ogunquit,Maine,USA,2001,pp.2-9.
    [18]C.A.B.M Ashburner,J A Blake,Gene Ontology:Tool for the Uniffication of Biology[J],The Gene Ontology Consortium,vo1(25),pp.25-29,2000.
    [19]蒋景瞳,美国国家制图数据标准[M].北京:测绘出版社,1990.11.
    [20]B.J.Bach T L,Bouquet P,et al,State of the Art on Ontology Alignment[D],Knowledge Web Deliverable 2.2.3,University of Karlsruhe,2004.
    [21]Wiederhold.G;An algebra for ontology composition[D],vol.D Monterey CA:U.S.Naval Postgraduate School,1994.
    [22]McCarthy.J,Notes on formalizing context[C].In Proceedings of the Fifth National Conference on Artificial Intelligence,Philadelphia,PA,1986
    [23]P.Mitra,G.Widedund,ML Kersten,A Graph-Oriented Model for articulation of Ontology Interdependencies[M].Lecture Notes in Computer Science.Sringer Berlin Heidelberg,2004.02
    [24]PRS Visser,VAM Tamma,An experience with ontology clustering for information integration[C],In:Proc of the IJCAI'99 Workshop on Intelligent Information Integration.Stockholm,Sweden,1999
    [25]AH Doan,J Madhavan,P Domingos,A Halevy,Learning to Map between Ontologies on the Semantic Web[C],In Proc.of World-Wide Web Conf,Hawaii,USA,2003,pp.662-673.
    [26]S Melnik,H Garcia-Molina,E Rahm,Similarity Flooding:A Versatile Graph Matching Algorithm[C],In.Proc.of the 18th International Conf on Data Engineering(ICDE),San,CA,2002.
    [27]Y Kalfoglou,M Schorlemmer,"Information-flow-based ontology mapping," in In Proc.of the first International Conference on Ontologies,Irvine,CA,USA,2002,pp.1132-1151.
    [28]A.Macedche,B Motik.N.Silva,etc,"MAFRA-An Ontology Mapping Framework for Distributed Ontologies," Web Intelligence and Agent System,pp.235-248,2003.
    [29]Klein.M,Combining and relating ontologies:an analysis of problems and solutions,In Proc.of Workshop on Ontologies and Information Sharing at the 17th International Joint Conference Seattle,WA,USA,2004.
    [30]Fensel.D,OIL in a nutshell[C],In The 12th International Conference on Knowledge Engineering and Knowledge Management Juanlespins,France,2000.
    [31]H.Chalupsky,"Ontomorph:a translation system for symbolic knowlege," in In Proc.of the 9th International Conference on Principles of Knowledge Representation and Reasoning San Francisco,CA:AAAI Press,2000.
    [32]J.Fowler,Brad Perry,Marine Nodine.et.dl.Agent-based semantic interoperability in Infosleuth,Sigmod Record,New York,USA,1999.
    [33]Wang Peng,Xu.Baowen.Lu Jianjiang,et al,Theory and semi-automatic generation of bridge ontology in multi-ontologies environment,In:Proceedings of the OTM 2004 Worshop on Ontologies,Semantics and E-learning Larnaca,Cyprus,2004
    [34]李红梅.地理空间实体类型语义相似度计算模型的研究[D].硕士学位论文,武汉:武汉大学,2005.05
    [35]刘震,基于对等资源中心网络的战场信息共享平台框架及其若干关键技术研究,长沙:国防科技大学信息系统与管理学院,工学博士学位论文,2006.09
    [36]N.Chatterjee,A Statistical Approach for Similarity Measurement Between Sentences for EBMT,in In:Proceedings of Symposium on Translation Support Systems(STRANS),2001.02.
    [37]G.R.E.Agirre,A proposal for word sense disambiguation using conceptual distance,In International Conference Recent Advances in Natual Lauguage Proceeding Tzigov Chark,Bulgaria,1995.
    [38]C.L.a.M.Chodorow,Combining local context and WordNet similarity for word sense identification,In Fellbaum,pp.256-283,1998.
    [39]A.Tversky,Features of similarity.Psychological review[J],vol.84(4),p.327-352,1977.
    [40]P.Gardenfors,Conceptual Spaces:The Geometry of Thought.Cambridge[M],Artificial inteligence.MIT Press,2000
    [41]Rd.Melara,Le Marks,Kelesko,Optional processes in similarity judgments,Perception & Psychophysics,vol.51(2),pp.132-133
    [42]RN.Shepard.Stimulus and Response Generalizaion:Deducion of the Generalizaion Gradient from a Trace Model,Psychological Revier,pp.242-256
    [43]R.Rada,H.Mili,E.Bicknell,M.Blettner.Development and application of a metric on semantic nets,IEEE Transactions on System,Man,and Cybernetics,vol.19(1),pp.17-30,1989
    [44]JJ Jiang,DW Conrath.a.Conrath19-35,Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy[C].In:International Conference on Computational Linguistics(ROCLING X),pp.19-35,1997
    [45]艾廷华,刘耀林,土地利用数据的聚合与融合[J],武汉大学学报,2002.10.
    [46]M.A.Rodriguez,“Assessing semantic similarity among spatial entity classes.”vol.PhD:University of Maine,2000.
    [47]K.M.Lee J,and Lee Y,Information retrieval based on conceptual distance in IS-A hierarchies,Journal of Documentation,vol.49(2),pp.188-207,1993.
    [48]P Resnik.Semantic similarity in a taxonomy:An information-based measure and its application to problems of ambiguity and natural language[J],Journal of Artificial Intelligence Research,vol.11,pp.95-130,1999.
    [49]B.Y,Semantic aspects of interoperable GIS[D].Ph.D The Netherlands:Wageningen Agricultural University and ITC,1997.
    [50]程勇,黄河,邱莉榕,史忠植,一个基于相似度计算的动态多维概念映射算法[J],小型微型计算机系统,Vol(27).No.6,2006.
    [51]Alexander Maedche,Measuring Similarity between Ontologies[J],Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management,London.UK,2002.
    [52]M.A.Rodriguez,Determining Semantic Similarity Among Entity Class from Different ontologies[J],IEEE Transaction on Knowledge and Data Engineering,vol(15),No.2,pp442-456,2003.
    [53]张红宇,数据集成中本体映射的研究.长沙:中南大学,硕士学位论文2005.
    [54]曹泽文,基于本体的异构知识集成技术及应用研究,长沙:国防科技大学信息系统与管理学院,工学博士学位论文,2006.10

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700