Show simple item record

Markov Chain Monte Carlo in Genetics: Subphenotyping, Linkage Disequilibrium Modeling, and Fine Mapping.

dc.contributor.authorGeng, Ziqianen_US
dc.date.accessioned2014-10-13T18:19:42Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2014-10-13T18:19:42Z
dc.date.issued2014en_US
dc.date.submitteden_US
dc.identifier.urihttps://hdl.handle.net/2027.42/108872
dc.description.abstractThe advance of modern genotyping and sequencing technologies makes large scale data available in different genetic studies. Meanwhile, MCMC algorithm provides powerful computational tools in handling these high-dimensional genetic data. In this dissertation, I demonstrate several MCMC applications in emerging genetic studies. In Chapter 2, I propose a method to identify genetically homogeneous subphenotypes of complex diseases. I assume that different disease subtypes, caused by different risk variants, behave uniquely in clinical characteristics (treated as covariates). I design an algorithm to identify these covariates to define genetically homogeneous subtypes. Conditional on these covariates, this algorithm calculate each affected individual’s posterior probability of belonging to each subtype. Using simulated data, I illustrate that my algorithm correctly identifies subtypes, such that affected individuals within each subtype group are likely to carry the same risk variants. I also evaluate whether stratifying on these estimated subtype memberships improves the power to detect phenotypic association at risk loci attributable to these subtypes. In Chapter 3, I introduce a novel algorithm to model the linkage disequilibrium (LD) between different genomic positions through shared genealogies. Compared to traditional hidden Markov models (HMM) which might over simplify the evolutionary process of sampled haplotypes, my method allows for more variations in prior probabilities about shared haplotype segments descend from particular ancestors, as well as more variations in population genetic parameters. Through this more careful model, our method improves the accuracy in haplotype reconstruction. Moreover, I propose a fine mapping algorithm based on this model to localize complex trait loci. My algorithm identifies disease causal loci accurately when traditional mapping approaches based on single marker tests have low power. In Chapter 4, I propose an approach to overcome the computational burden in fine mappings using our coalescent-based modeling. I first estimate a set of clusters of sampled haplotypes such that members within each cluster share one common ancestor. I then make inferences about genealogies of these clusters to localize candidate regions of disease-causing mutations. Using simulated data, I illustrate that this implementation enables my fine mapping approach in large samples with several tens of thousands of individuals.en_US
dc.language.isoen_USen_US
dc.subjectStatistical Geneticsen_US
dc.subjectPopulation Geneticsen_US
dc.subjectMorkov Chain Monte Carloen_US
dc.titleMarkov Chain Monte Carlo in Genetics: Subphenotyping, Linkage Disequilibrium Modeling, and Fine Mapping.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBiostatisticsen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.contributor.committeememberZoellner, Sebastian K.en_US
dc.contributor.committeememberBurmeister, Margiten_US
dc.contributor.committeememberBoehnke, Michael Leeen_US
dc.contributor.committeememberJohnson, Timothy D.en_US
dc.contributor.committeememberWen, Xiaoquan Williamen_US
dc.subject.hlbsecondlevelStatistics and Numeric Dataen_US
dc.subject.hlbtoplevelScienceen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/108872/1/zgeng_1.pdf
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.