Statistical Analysis for Genomic Studies Involving Measurement Error, Multiple Populations, and Limited Sample Size.

Zhang, Juan

Statistical Analysis for Genomic Studies Involving Measurement Error, Multiple Populations, and Limited Sample Size.

dc.contributor.author	Zhang, Juan	en_US
dc.date.accessioned	2013-06-12T14:16:14Z
dc.date.available	NO_RESTRICTION	en_US
dc.date.available	2013-06-12T14:16:14Z
dc.date.issued	2013	en_US
dc.date.submitted	2013	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/97913
dc.description.abstract	Genomic studies involve various types of high-dimensional data. Study designs are often complex, and data are difficult to collect. For example, the subjects may belong to distinct populations, the number of subjects is often small, and substantial measurement error is usually present. In this thesis, we consider three important issues that arise in this research setting. The impact of measurement error on parameter estimation has been extensively studied, but its effects on predictive performance have not been. In part 1 of the thesis, we partially characterize the data generating models that are most adversely impacted by measurement error. These results may help researchers judge whether improving data collection procedures, or identifying more informative markers would have a greater impact on predictive performance. In part 2 of the thesis, we present a new approach for identifying the common and unique marker/outcome associations that are present in a genomic dataset consisting of several subpopulations. We show that the natural plug-in style estimates of overlap are biased, and we demonstrate a copula-based approach to reducing the bias. Part 3 of the thesis considers situations in which power for attributing effects to specific markers is low, but meaningful relationships between marker/outcome associations and other statistical properties of the markers can be identified.	en_US
dc.language.iso	en_US	en_US
dc.subject	Measurement Error	en_US
dc.subject	Effect Size	en_US
dc.title	Statistical Analysis for Genomic Studies Involving Measurement Error, Multiple Populations, and Limited Sample Size.	en_US
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Statistics	en_US
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies	en_US
dc.contributor.committeemember	Shedden, Kerby A.	en_US
dc.contributor.committeemember	Jiang, Hui	en_US
dc.contributor.committeemember	Hansen, Ben B.	en_US
dc.contributor.committeemember	Kretzler, Matthias	en_US
dc.subject.hlbsecondlevel	Statistics and Numeric Data	en_US
dc.subject.hlbtoplevel	Science	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/97913/1/zjnankai_1.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: zjnankai_1.pdf
Size:: 18.15MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.