基于SMSQL的异构数据库集成技术研究与实现
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
异构数据库集成技术是对地理上分布的多个异构数据库,在尽可能少地影响本地自治性的基础上,构造具有用户所需要的透明性的全局数据库,屏蔽各参与局部数据库之间的异构性,实现各局部数据库的信息共享和透明访问,是数据库理论与应用领域的研究热点之一,受到越来越多研究者的关注。
     本文分析了异构数据库集成的基本原理和典型框架,针对多数据库语言系统的优势和传统多库操作语言的不足,提出在传统多库操作语言中增加模式复用的模式集成概念,定义了新的多库操作语言SMSQL(Schema Mapping SQL),设计并实现了解析执行SMSQL语言功能的SMSQL中间件,并将之运用于具体项目之中。
     本文的主要工作包括:
     1.分析研究了异构数据库集成的基本原理和集成目标,详细分析了现有的异构数据库集成技术的优缺点,为本文的进一步研究奠定了基础。
     2.分析了多数据库语言系统抽象与实现相分离的优势,并针对传统多库操作语言缺乏模式集成概念、用户操作复杂的缺点,提出模式复用的模式集成概念,定义了具体的模式映射规则并通过字段转化来解决不同模式集成时的模式冲突,最后在SQL-92基础上设计了包含模式集成语言集的多库操作语言SMSQL,解决了传统多库语言缺乏全局模式、用户负担过重的问题。
     3.在定义的SMSQL基础上,提出基于SMSQL的异构数据库集成框架,并对框架中负责解析SMSQL语言的中间件从全局数据字典设计、SMSQL语言解析、查询处理、查询分解等几个方面进行了设计与实现。
     4.在某市的网格化机动车识别综合应用系统中利用SMSQL中间件操作底层异构数据库,降低了系统内部的耦合度,提高了开发效率,增强了系统的可维护性,验证了中间件的实用性。
The integration of heterogeneous database technology has become one of the hot spots of the theory and application in the field of database. Its purpose is to structure transparent global database and shield the heterogeneity between participating databases (local database) which is geographical distributed.
     The paper analyses the basic principles and typical framework of the heterogeneous database integration. Based on pointing out the advantages of multi-language database system and the deficiency of traditional languages, this paper proposes the concepts of global schema, schema mapping and definition of a mulitdatabase language SMSQL (Schema Mapping SQL). The next thing is designing and implementing the SMSQL middleware (which analyze and execute the SMSQL language function) to application in specific projects.
     The main contents of the paper include:
     1. Introduce the basic concepts and objectives of the heterogeneous database integration. Provide a basis for further study by analyzing the advantages and disadvantages of the existing heterogeneous database integration technology.
     2. Analyze the advantages of the multi-database language system of separating the abstract and implementation, and the disadvantages of the traditional multi-databases languages (lacking of global schema, so users are hard to operate the system/ user-unfriendly). Next, the writer proposes model reuse integrated concept, defines of a mulitdatabase language SMSQ which is Schema Mapping SQL, supports the definition or modification of the global schema and also resolves the potential conflicts due to structural and semantic heterogeneity of the integrated data.
     3. On the basis of heterogeneous databases integration framework based on SMSQL, the writer proposes to design and implement the SMSQL middleware from several aspects including global data dictionary, SMSQL language analysis, query processing, query decomposition and so on.
     4. Using the SMSQL middleware in the project of grid vehicle identification application system to handle substrate heterogeneous database reduces coupling of the system, improves the development efficiency and enhances the maintainability of the system. Practical application shows that operationality, maintainability and scalability of the framework are desirable.
引文
[1]吴永英,吕继云,班鹏新.基于JMS和XML的数据集成研究[J].计算机应用研究,2004,(7):43-45.
    [2]杨先娣,彭智勇,刘君强等.信息集成研究综述[J].计算机科学,2006,33(7):55-59.
    [3]Agarwal S,Keller A M.An Approach for Integrating Data from Multiple[J].Possible Inconsitent Database.Proc IEEE Int,Conf,On Data Engineering.1995:495-504.
    [4]卢正鼎,李兵,肖卫军等.基于CORBA/XML的多数据库系统研究与实现[J].计算机研究与发展,2002,39(4):443-449.
    [5]李瑞轩,卢正鼎.多数据库原理与技术[M].北京:电子工业出版社,2005.
    [6]汪青峰,梁允荣.基于CORBA的异种数据库联合使用方案[J].计算机应用研究,1999,(11):91-93.
    [7]R Ahmed,P DeSmedt et al.The Pegasus heterogeneous multidatabase system[J].IEEE Coputer,1991,(24):19-27.
    [8]G.K Attaluri,D.P Bradshaw,N Cobum et al.The CORDS multidatabase project[J].IBM Systems Journal,1995,34(1):39-62,
    [9]M templeton,E lund,P ward.Pragmatics of access control in Mermaid[J].IEEE Data Eng,1987,10(3):33-38.
    [10]John Miles Smith,Philip A Bernstein,Umeshwar Dayal et al.Multibase—integrating heterogeneous distributed database systems[C].National Computer Conference,1986:335-347.
    [11]Albert D Andrea,Phil Janus.UniSQL's next-generation object-relational database management system[J].ACM SIGMOD Record,1996,(25):70-76.
    [12]Mudar Sarem,李瑞轩,卢正鼎等.Panorama多数据库原型系统[J].华中科技大学学报,2001,29(12):76-78.
    [13]吴胜利,钟华,黄涛等.关系型多数据库系统IS-Global的设计与实现[J].软件学报1999,10(8):877-882.
    [14]刘艳梅.基于COM/DCOM组件标准实现异构数据库的联合使用[D].北京:北京理工大学,2000.
    [15]邢艳辉.基于Multi-agent的异构数据库联合使用[D].北京:北京理工大学,2000.
    [16]Y.Zhuge,H.Garcia-Molina,J.Widom.View maintenance in a warehousing environment[J].ACM SIGMOD Symp on the Management of Data,1995:316-327.
    [17]Rakesh Agrawal,Alexandre Evfimievski,Ramakrishnan Srikant.Information sharing across private databases[C].Conference on Management of Data(SIGMOD),2003:86-97.
    [18]M.P.Reddy,B.E.Prasad,P.G.Reddy,et al.A methodology for integration of heterogeneous databases[J].IEEE Transaction on Knowledge and data Enginering,1994,6(6).
    [19]Chern-Sing Goh,Ning Lan,Nathaniel Echols et al.SPINE 2:a system forcollaborative structural proteomics within a federated databaseframework[J].Nucleic Acids Res,2003,31,2833-2838.
    [20]P.Scheuermann,Wen-Syan Li,Chris Clifton.Mutidatabase Query Processing with Uncertainty in Global Keys and Attribute Values[J].Journal of the American Society for Information science,1998,49(3):283-301.
    [21]Litwin W.MSQL:A multidatabase language.Information Science[J].Information Sciences:an International Journa,1989,48(2).
    [22]L.Lakshmanan,F.Sadri,S.Subramanian.SchemaSQL-an extension to SQL for multida tabase Interoperability[J].ACM Transactions on Database Systems(TODS),2001,26(4):476-519.
    [23]Catharine M.Wyss,Edward L.Robertson.Relational languages for metadata integration[J].ACM Transactions on Database Systems(TODS),2005,30(2):624-660.
    [24]R.Sunderraman,E.Dogdu,P.Madiraju et al.A Java API for Global Querying and Updates for a System of Databases[C].Proceedings of 43rd ACMSouth East Conference.Georgia,2005.
    [25]李瑞轩,卢正鼎,肖卫军等.多数据库系统中基于XIDM的模式映射方法研究[J].计算机研究与发展,2004,41(3):485-491.
    [26]陈小武,潘章晟,赵沁平.网格环境中模式复用的异构数据库访问和集成方法[J].软件学报,2006,17(11):2224-2233.
    [27]刘海燕,陈火旺,刘凤歧.一个用于异构的数据库集成的多Context逻辑[J].软件学报,1997(9):687-693.
    [28]格罗夫,温伯格.sq1完全手册[M].章晓莉译.北京:电子工业出版社,2006.
    [29]姚文琳,王存刚,刘世栋等.基于Oracle的分布式数据库设计与技术[J].计算机工程,2006,32(20):89-91.
    [30]施卫峰,周俊.异构环境下Sybase与MS SQL Server的互访性研究[J].计算机工程,2003,29(7):85-87.
    [31]乔丹,恭德罗依,斯卡里斯.SQL Server 2005从入门到精通[M].曲丽君译.北京:电子工业出版社,2006.
    [32]Informix Software.Informix SQL参考与语法[M].张光业译.北京:人民邮电出版社,2001.
    [33]蒋雄飞,陈飞,居悌.FoxPro、Access和MS SQL Server异构数据库之间的数据格式转换[J].南京邮电学院学报,2000,20(4):74-75.
    [34]沃斯瓦尼.MySQL完全手册[M].徐小青译.北京:电子工业出版社,2004.
    [35]吴胜利,王能斌.数据库中基本域的扩充[J].计算机研究与发展,1995,1(32):11-13.
    [36]Dimitrios Georgakopoulous,Marek Rusinkiewicz,Amit Psheth.Using ticket to enforce the seriali zability of multidatabase transactions[J].IEEE Trans on Knowledge and Data Engineering,1994,16(1):166-180.
    [37]霍斯特曼,康奈尔.Java 2核心技术[M].叶乃文译.北京:人民邮电出版社,2006.
    [38]黄萍,潘荫荣,胡幼华.基于Java和XML跨平台数据迁移的设计与实现[J].计算机工程,2005,31(17):74-75.
    [39]刘晓华,陈亚强.J2EE应用开发详解[M].北京:电子工业出版社,2004.
    [40]曾亮,齐欢,王小平等.基于J2EE核心模式的组合Web框架研究[J].华中科技大学学报,2007,35(6):43-46.
    [41]福禄瑞,斯坦克,瑞哈斯.JBoss 4.0标准教材[M].刘凡译.北京:电子工业出版社,2006.
    [42]里斯.JDBC与Java数据库编程[M].石永鑫译.北京:中国电力出版社,2002.
    [43]张少中,王秀坤,张志勇.基于JDBC的异构分布式数据库访问[J].计算机工程,2002,28(12):273-275.
    [44]Terence Parr.The Definitive ANTLR Reference[M].Pragmatic Bookshelf,2007.
    [45]李瑞轩,霍晓丽,文珠穆等.多数据库系统中的全局查询转换方法研究[J].计算机工程,2005,31(16):4-6.
    [46]沃尔斯,布雷登巴赫.Spring in Action中文版[M].李磊译.北京:人民邮电出版社,2006.
    [47]蒋纬,马光思.Spring与其他框架整合及流程分析[J].计算机工程,2007,33(14):79-81.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700