Order-preserving indexing
详细信息    查看全文
文摘
Kubica et al. [33] and Kim et al. [29] introduced order-preserving pattern matching: for a given text the goal is to find its factors having the same ‘shape’ as a given pattern. Known results include a linear-time algorithm for this problem (in case of polynomially-bounded alphabet) and a generalization to multiple patterns. We propose an index that enables order-preserving pattern matching queries in time proportional to pattern length. The index can be constructed in class="mathmlsrc">class="formulatext stixSupport mathImg" data-mathURL="/science?_ob=MathURL&_method=retrieve&_eid=1-s2.0-S0304397515005605&_mathId=si1.gif&_user=111111111&_pii=S0304397515005605&_rdoc=1&_issn=03043975&md5=2820969bc52e04b15a1d8efd935af960" title="Click to view the MathML source">O(nlog⁡log⁡n)class="mathContainer hidden">class="mathCode">O(nloglogn) expected time or in class="mathmlsrc">class="formulatext stixSupport mathImg" data-mathURL="/science?_ob=MathURL&_method=retrieve&_eid=1-s2.0-S0304397515005605&_mathId=si11.gif&_user=111111111&_pii=S0304397515005605&_rdoc=1&_issn=03043975&md5=f8c5bd4ae42a9021f41c00bb596f096a" title="Click to view the MathML source">O(nlog2⁡log⁡n/log⁡log⁡log⁡n)class="mathContainer hidden">class="mathCode">O(nlog2logn/logloglogn) worst-case time. It is an incomplete order-preserving suffix tree which may miss a single edge label at each branching node. For most applications such incomplete suffix trees provide the same functional power as the complete ones. We show a number of their applications, including computation of longest common factors, longest previously occurring factors and squares in a string in the order-preserving setting. We also give an class="mathmlsrc">title="View the MathML source" class="mathImg" data-mathURL="/science?_ob=MathURL&_method=retrieve&_eid=1-s2.0-S0304397515005605&_mathId=si247.gif&_user=111111111&_pii=S0304397515005605&_rdoc=1&_issn=03043975&md5=463a1d2ecf07bb369fe69371e51ef2ac">class="imgLazyJSB inlineImage" height="19" width="68" alt="View the MathML source" title="View the MathML source" src="/sd/grey_pxl.gif" data-inlimgeid="1-s2.0-S0304397515005605-si247.gif">class="mathContainer hidden">class="mathCode">O(nlogn)-time algorithm constructing complete order-preserving suffix trees.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700