Efficient Processing of Distributed Twig Queries Based on Node Distribution
详细信息    查看全文
  • 作者:Xin Bi ; Xiang-Guo Zhao ; Guo-Ren Wang
  • 关键词:XML ; twig query ; distributed computing ; node distribution
  • 刊名:Journal of Computer Science and Technology
  • 出版年:2017
  • 出版时间:January 2017
  • 年:2017
  • 卷:32
  • 期:1
  • 页码:78-92
  • 全文大小:
  • 刊物类别:Computer Science
  • 刊物主题:Computer Science, general; Software Engineering; Theory of Computation; Data Structures, Cryptology and Information Theory; Artificial Intelligence (incl. Robotics); Information Systems Applications (
  • 出版者:Springer US
  • ISSN:1860-4749
  • 卷排序:32
文摘
Massive XML data are increasingly generated for the representation, storage and exchange of web information. Twig query processing over massive XML data has become a research focus. However, most traditional algorithms cannot be directly implemented in a distributed manner. Some of the existing distributed algorithms generate a lot of useless intermediate results and execute many join operations of partial results in most cases; others require the priori knowledge of query pattern before XML partition, storage and query processing, which is impractical in the cases of large-scale data or frequent incoming new queries. To improve efficiency and scalability, in this paper, we propose a 3-phase distributed algorithm DisT3 based on node distribution mechanism to avoid unnecessary intermediate results. Furthermore, we propose a lightweight local index ReP with an enhanced XML partitioning approach using arbitrary partitioning strategy, and based on ReP we propose an improved 2-phase distributed algorithm DisT2ReP to further reduce the communication cost. After the performance guarantees are analyzed, extensive experiments are conducted to verify the efficiency and scalability of our proposed algorithms in distributed twig query applications.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.