摘要
为解决现有原生XML数据库查询效率不高、查询并发度低、数据库规模小及可扩展性不足等问题,基于蚁群高效多用计算平台设计一个大规模分布式原生XML数据库原型系统。采用软硬协同设计思想,在硬件部分设计多并行、全流水、可配置的查询过滤结构,以实现高效查询。利用软件平台的通用性,实现数据库的灵活配置和查询结果排序、筛选等复杂处理,并针对现有原生XML,数据库存在的问题给出相应的解决方法。实验结果表明,该系统可行有效,查询效率相对于Tamino数据库有30倍左右的性能提升,并且能够支持大量并发操作,具有良好的扩展性。
To solve the problems of inefficient queries,low concurrency,small database capacity and bad scalability of the existing NativeXML database,a large-scale distributed NativeXML database prototype is designed based on efficient and multi-purpose computing platform.It adopts hardware-software co-design and designs a query filtering structure with high concurrency,fully pipelining and configurability in the hardware to process queries efficiently,while using the software to flexibly configure the database and do other complex processing like sorting and filtering.This paper also gives the solutions for the problems of existing NativeXML database.Experimental results show that the proposed prototype is feasible and effective.It has about 30 times speedup over Tamino database in query efficiency,supports high concurrency and has good scalability.
引文
[1]谷瑜青.XML数据库及其应用研究[J].电脑编程技巧与维护,2015(10):80-81.
[2]Meier W.eXist:An Open Source Native XML Database[M]//Chaudhri A B.Jeckle M.Rahm E,et al.Web,Web-Services,and Database Systems.Berlin,Germany:Springer,2002:169-183.
[3]Yuan Xiaojie.Hu Xiangyu,Wu Dongxing,et al.XML Data Storage and Query Optimization in Relational Ratabase by XPath Processing Model[J].Journal of Software,2013,8(4):809-816.
[4]Lee K K Y,Tang W C,Choi K S.Alternatives to Relational Database;Comparison of NoSQL and XML Approaches for Clinical Data Storage[J].Computer Methods and Programs in Biomedicine,2013,110(1):99-109.
[5]刘津伸.纯XML数据库中基于索引的并行查询处理技术的研究[D].沈阳:东北大学,2011.
[6]罗玉.基于XML数据库查询优化技术的研究[D].成都:西南交通大学,2014.
[7]Jatana N.Puri S.Ahuja M,et al.A Survey and Comparison of Relational and Non-relational Database[J].International Journal of Engineering Research and Technology,2012,1(6):l-5.
[8]谢向辉,钱磊,吴东,等.蚁群:一种新型的高效多用计算平台[J].计算机研究与发展,2015,52(6):1341-1350.
[9]赵鹏飞,李乃乾.基于XML的分布式数据库集成系统[J].计算机工程,2010,36(13):70-72.
[10]王晟中,陈伟男,彭澄廉.可重构计算硬件平台的改进设计[J].计算机工程,2010,36(5):250-252.
[11]唐燕,田英爱,李宁,等.基于XML的流式文档访问方式分析[J].计算机工程与设计,2014,35(4):1458-1464.
[12]Teubner J,Woods L,Nie Chongling.Skeleton Automata for FPGAs;Reconfiguring Without Reconstructing[C]//Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data.New York,USA:ACM Press,2012:229-240.
[13]冯进,丁博,史殿习,等.XML解析技术研究[J].计算机工程与科学,2009,31(2):120-124.
[14]Noaman A Y,AI Mansour A A.A Comparative Study Between Two Types of Database Management Systems:XML-enabled Relational and Native XML[J].World Applied Sciences Journal,2012,19(7):972-985.
[15]皮涛,罗春风.基于Oracle XML DB技术的XML数据存储研究[J].软件导刊,2010,9(6):174-176.
[16]SchmidtA,Waas F,Kersten M,et al.XMark:A Benchmark for XML Data Manage ment[C]//Proceedings of the 28th International Conference on Very Large Data Bases.[S.1.]:VLDB Endowment,2002:974-985.