Compressed Subsequence Matching and Packed Tree Coloring

详细信息查看全文

作者：Philip Bille ; Patrick Hagge Cording ; Inge Li Gørtz
关键词：Straight line program ; SLP ; Compressed ; Subsequence matching ; Tree coloring ; First colored ancestor
刊名：Algorithmica
出版年：2017
出版时间：February 2017
年：2017
卷：77
期：2
页码：336-348
全文大小：
刊物类别：Computer Science
刊物主题：Algorithm Analysis and Problem Complexity; Theory of Computation; Mathematics of Computing; Algorithms; Computer Systems Organization and Communication Networks; Data Structures, Cryptology and Inform
出版者：Springer US
ISSN：1432-0541
卷排序：77

文摘

We present a new algorithm for subsequence matching in grammar compressed strings. Given a grammar of size n compressing a string of size N and a pattern string of size m over an alphabet of size \(\sigma \), our algorithm uses \(O(n+\frac{n\sigma }{w})\) space and \(O(n+\frac{n\sigma }{w}+m\log N\log w\cdot occ)\) or \(O(n+\frac{n\sigma }{w}\log w+m\log N\cdot occ)\) time. Here w is the word size and occ is the number of minimal occurrences of the pattern. Our algorithm uses less space than previous algorithms and is also faster for \(occ=o(\frac{n}{\log N})\) occurrences. The algorithm uses a new data structure that allows us to efficiently find the next occurrence of a given character after a given position in a compressed string. This data structure in turn is based on a new data structure for the tree color problem, where the node colors are packed in bit strings.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700