摘要
为了提高XML数据库查询引擎中核心的Twig查询处理效率,提出基于语义信息的Twig查询处理TwigRT算法.该算法根据XML模式定义中的对象语义信息识别XML数据中的对象,将其属性和值存储在关系数据库表中;Twig查询分解为内容查询和结构查询两部分,其中内容查询部分通过SQL在数据库表中的查询实现,起到缩减结构查询范围的作用;结构查询部分通过整体结构匹配算法实现.最后通过实验验证了算法的有效性.
An algorithm TwigRT is proposed to improve the efficiency of twig query processing in XML database.TwigRT can store an object′s properties and their value into relational tables according to the semantics defined in XML schema.Twig query is decomposed into content query and structure query,then content query is processed by execute SQL on relational tables,and structure query search scope is reduced by previous step results and can be processed using holistic join algorithm.The experimental results show that our approach is scalable and efficient on this problem.
引文
[1]Gou G,Chirkova R.Efficiently querying large XML data repositories:a survey[J].IEEE Transactions on Knowledge and Data Engineering,2007,19(10):1381-1403.
[2]曹叡,吴玲达,邓维.一种面向重构的XML混合编码方法[J].微电子学与计算机,2014,31(4),:1-5.
[3]Al-Khalifa S,Jagadish H V,Koudas N,et al.Structural Joins:A primitive for efficient XML query pattern matching[C]∥Proc of the 18th International Conference on Data Engineering(ICDE 02).[s.l.]:IEEE,2002:141-152.
[4]Bruno N,Koudas N,Srivastava D.Holistic twig joins:Optimal XML pattern matching[C]∥Proc of SIGMOD.[s.l.]:IEEE,2002:310-321.
[5]Chen S,Li H G,Tatemura J,et al.Twig2Stack:bottom-up processing of generalized-tree-pattern queries over XML documents[C]∥Proc of the 32nd international conference on Very Large Data Bases.[s.l.]:IEEE,2006:283-294.
[6]Qin L,Yu J X,and Ding B.TwigList:make twig pattern matching fast[C]∥In Advances in Databases:Concepts,Systems and Applications.Michigan:IEEE,2007:850-862.
[7]Wu H,Ling T W,Chen B,et al.TwigTable:using semantics in XML twig pattern query processing[J].Journal on Data Semantics XV,2011(5):102-129.
[8]XMark.An xml benchmark project[EB/OL].[2014-07-30].http:∥www.xml-benchmark.org.
[9]DBLP.[EB/OL].[2014-07-30].http:∥dblp.uni-trier.de/xml/