An online approach for DNA sequencing error correction via disk based index
dc.contributor.author | Gu, Y | |
dc.contributor.author | Zhu, Q | |
dc.contributor.author | Pramanik, S | |
dc.date.accessioned | 2024-09-27T01:23:36Z | |
dc.date.available | 2024-09-27T01:23:36Z | |
dc.date.issued | 2018-01-01 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/195087 | |
dc.description.abstract | DNA sequencing has been widely used in many biological studies such as gene expression analysis and biomedical diagnostics. To solve the problem of sequencing errors produced by a sequencer, researchers perform error correction as a first step in sequence analysis. To overcome the scalability problem of existing memory based error correction methods, a disk based sequencing error correction method, called the DiskBQcor, was recently proposed. It utilizes a disk based index tree to store k-mers from sequencing reads. It then analyzes the results of special box queries run on the index tree to efficiently correct sequencing errors. As an offline approach, the DiskBQcor corrects errors after all the k-mers for a given genome dataset are inserted into the index tree. In this paper, we present an online approach, called the DiskBQcor ∗ , for sequencing error correction to better utilize computing resources. It extends the DiskBQcor by introducing an online analysis process during which sequencing errors are identified and corrected in an online fashion. The relevant correction algorithms, statistical measures, and error identifying strategies are discussed. Our experiments demonstrate that the proposed online method is quite promising in error correction for sequencing genome data on disk. | |
dc.title | An online approach for DNA sequencing error correction via disk based index | |
dc.type | Conference Paper | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/195087/2/SEDE2018_pub.pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/24326 | |
dc.identifier.source | 27th International Conference on Software Engineering and Data Engineering, SEDE 2018 | |
dc.description.version | Published version | |
dc.date.updated | 2024-09-27T01:23:35Z | |
dc.identifier.startpage | 32 | |
dc.identifier.endpage | 38 | |
dc.identifier.name-orcid | Gu, Y | |
dc.identifier.name-orcid | Zhu, Q | |
dc.identifier.name-orcid | Pramanik, S | |
dc.working.doi | 10.7302/24326 | en |
dc.owningcollname | Computer and Information Science, Department of (UM-Dearborn) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.