Development and Application of Methods to Discover Cancer-Associated Transcript Variants.
dc.contributor.author | Veeneman, Brendan | |
dc.date.accessioned | 2017-01-26T22:20:05Z | |
dc.date.available | NO_RESTRICTION | |
dc.date.available | 2017-01-26T22:20:05Z | |
dc.date.issued | 2016 | |
dc.date.submitted | 2016 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/135887 | |
dc.description.abstract | Cancer is and has long been a major threat to human health, and in seeking to better treat cancer, we seek first to better understand cancer. Consequently, the current era of cancer research has aimed to catalog the full range of molecular abnormalities in cancer's genome, epigenome, transcriptome, and proteome. Next-generation short read sequencing has empowered these cataloging efforts, but requires sophisticated algorithms to fully harness, particularly in the case of splicing and transcript variation. The aim of this dissertation was to address this need by establishing and applying novel methods to analyze RNA sequencing data in cancer. In chapter one, we present Oculus, a software package that attaches to standard aligners and exploits read redundancy by performing streaming compression, alignment, and decompression of input sequences. This nearly lossless process (> 99.9%) led to alignment speedups of up to 270% across a variety of data sets. In chapter two, we profile performance characteristics of two-pass alignment, which separates splice junction discovery from quantification. Across a variety of transcriptome sequencing datasets, two-pass alignment improved quantification of at least 94% of simulated novel splice junctions, and provided as much as 1.7-fold deeper median read depth over those splice junctions. Two-pass alignment promises to advance quantification and discovery of novel splicing events. In chapter three, we present a novel bioinformatics pipeline to analyze splicing and transcript variation from cancer transcriptome data, using splice junction read depth, and correlative analysis to circumvent known biases such as tumor content. We demonstrate the value of this pipeline through application to the oncogenes MET and ALK. Finally, in chapter four, we present the application of our transcript variant calling pipeline to transcriptome data from prostate cancer. We present several recurrently differentially spliced genes which are not attributable to noise or bias and may serve as novel biomarkers, evidence for transcript variants of the androgen receptor, and an apparent genome-wide pattern of alternative transcription start site usage. | |
dc.language.iso | en_US | |
dc.subject | bioinformatics | |
dc.subject | cancer biology | |
dc.subject | sequence alignment | |
dc.subject | RNA splicing | |
dc.subject | transcript variation | |
dc.subject | transcriptomics | |
dc.title | Development and Application of Methods to Discover Cancer-Associated Transcript Variants. | |
dc.type | Thesis | en_US |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Bioinformatics | |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | |
dc.contributor.committeemember | Chinnaiyan, Arul M | |
dc.contributor.committeemember | Nesvizhskii, Alexey | |
dc.contributor.committeemember | Jiang, Hui | |
dc.contributor.committeemember | Mills, Ryan Edward | |
dc.contributor.committeemember | Omenn, Gilbert S | |
dc.contributor.committeemember | Tomlins, Scott Arthur | |
dc.subject.hlbsecondlevel | Genetics | |
dc.subject.hlbsecondlevel | Molecular, Cellular and Developmental Biology | |
dc.subject.hlbsecondlevel | Oncology and Hematology | |
dc.subject.hlbsecondlevel | Pathology | |
dc.subject.hlbsecondlevel | Statistics and Numeric Data | |
dc.subject.hlbtoplevel | Health Sciences | |
dc.subject.hlbtoplevel | Science | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/135887/1/veeneman_1.pdf | |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.