Practical methods for constructing suffix trees

Hankins, Richard A.; Tata, Sandeep; Patel, Jignesh M.; Tian, Yuanyuan

Practical methods for constructing suffix trees

Hankins, Richard A.; Tata, Sandeep; Patel, Jignesh M.; Tian, Yuanyuan

2005-09

View/Open

778_2005_Article_154.pdf

(1.2MB

PDF)

Citation

Tian, Yuanyuan; Tata, Sandeep; Hankins, Richard A.; Patel, Jignesh M.; (2005). "Practical methods for constructing suffix trees." The VLDB Journal 14(3): 281-299. <http://hdl.handle.net/2027.42/47869>

Abstract

Sequence datasets are ubiquitous in modern life-science applications, and querying sequences is a common and critical operation in many of these applications. The suffix tree is a versatile data structure that can be used to evaluate a wide variety of queries on sequence datasets, including evaluating exact and approximate string matches, and finding repeat patterns. However, methods for constructing suffix trees are often very time-consuming, especially for suffix trees that are large and do not fit in the available main memory. Even when the suffix tree fits in memory, it turns out that the processor cache behavior of theoretically optimal suffix tree construction methods is poor, resulting in poor performance. Currently, there are a large number of algorithms for constructing suffix trees, but the practical tradeoffs in using these algorithms for different scenarios are not well characterized.

Publisher

Springer-Verlag

ISSN

0949-877X

1066-8888

Other DOIs

http://dx.doi.org/10.1007/s00778-005-0154-8

Types

Article

Handle

https://hdl.handle.net/2027.42/47869

Metadata

Show full item record

Collections

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.