Show simple item record

Oculus: faster sequence alignment by streaming read compression

dc.contributor.authorVeeneman, Brendan A
dc.contributor.authorIyer, Matthew K
dc.contributor.authorChinnaiyan, Arul M
dc.date.accessioned2015-08-07T17:39:09Z
dc.date.available2015-08-07T17:39:09Z
dc.date.issued2012-11-13
dc.identifier.citationBMC Bioinformatics. 2012 Nov 13;13(1):297
dc.identifier.urihttps://hdl.handle.net/2027.42/112673en_US
dc.description.abstractAbstract Background Despite significant advancement in alignment algorithms, the exponential growth of nucleotide sequencing throughput threatens to outpace bioinformatic analysis. Computation may become the bottleneck of genome analysis if growing alignment costs are not mitigated by further improvement in algorithms. Much gain has been gleaned from indexing and compressing alignment databases, but many widely used alignment tools process input reads sequentially and are oblivious to any underlying redundancy in the reads themselves. Results Here we present Oculus, a software package that attaches to standard aligners and exploits read redundancy by performing streaming compression, alignment, and decompression of input sequences. This nearly lossless process (> 99.9%) led to alignment speedups of up to 270% across a variety of data sets, while requiring a modest amount of memory. We expect that streaming read compressors such as Oculus could become a standard addition to existing RNA-Seq and ChIP-Seq alignment pipelines, and potentially other applications in the future as throughput increases. Conclusions Oculus efficiently condenses redundant input reads and wraps existing aligners to provide nearly identical SAM output in a fraction of the aligner runtime. It includes a number of useful features, such as tunable performance and fidelity options, compatibility with FASTA or FASTQ files, and adherence to the SAM format. The platform-independent C++ source code is freely available online, at http://code.google.com/p/oculus-bio .
dc.titleOculus: faster sequence alignment by streaming read compression
dc.typeArticleen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/112673/1/12859_2012_Article_5548.pdf
dc.identifier.doi10.1186/1471-2105-13-297en_US
dc.language.rfc3066en
dc.rights.holderVeeneman et al.; licensee BioMed Central Ltd.
dc.date.updated2015-08-07T17:39:09Z
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.