Show simple item record

Statistical Methods for Low-frequency and Rare Genetic Variants.

dc.contributor.authorMa, Clementen_US
dc.date.accessioned2015-01-30T20:12:11Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2015-01-30T20:12:11Z
dc.date.issued2014en_US
dc.date.submitteden_US
dc.identifier.urihttps://hdl.handle.net/2027.42/110435
dc.description.abstractGenetic association studies using sequencing, dense-array genotyping, or sequencing-based imputation provide the means to identify low-frequency and rare variants associated with diseases and traits, but analysis of these variants presents new statistical challenges. Single marker tests (e.g. logistic and linear regression), and methods to combine information across studies (e.g. joint and meta-analysis) may be poorly calibrated and/or of low power. The calibration and power of aggregation tests, where multiple rare variants are analyzed jointly, have not been evaluated for variants on the X chromosome. In my dissertation, I address three topics: First, for case-control studies, I evaluate the calibration and power of four logistic regression tests in joint and meta-analysis for low-frequency and rare variants and demonstrate that: (a) for joint analysis, the Firth bias-corrected test is best (e.g. most powerful among well-calibrated tests); (b) for meta-analysis of balanced studies (equal numbers of cases and controls), the score test is best, but is less powerful than Firth test-based joint analysis; and (c) for meta-analysis of sufficiently unbalanced studies, all four tests can be anti-conservative, particularly the score test. Second, for quantitative trait (QT) studies, I evaluate the calibration and power of linear regression in joint and meta-analysis and demonstrate for normally distributed QTs that: joint and sample-size weighted meta-analysis are equally well-calibrated and powerful for variants with expected minor allele count E[MAC]≥10; inverse-variance weighted meta-analysis is slightly anti-conservative for small-sized studies. For non-normally distributed QTs, joint and meta-analysis is equally anti-conservative for low-frequency and rare variants. Inverse-normal transformation of the QT remedies this problem, but transforming QTs of any distribution reduces power. Third, for case-control and QT studies, I evaluate the calibration and power of three aggregation tests for the X chromosome: burden, SKAT, and SKAT-O. For case-control studies, tests are relatively well-calibrated across all simulation scenarios. Power is usually slightly increased when the coding scheme for male genotypes matches the underlying model, but power loss is small when the model is misspecified. Differences in male:female ratio in cases and controls have little effect on power. For QTs, calibration and power results are very similar to those for binary traits.en_US
dc.language.isoen_USen_US
dc.subjectStatistical geneticsen_US
dc.titleStatistical Methods for Low-frequency and Rare Genetic Variants.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBiostatisticsen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.contributor.committeememberScott, Lauraen_US
dc.contributor.committeememberBoehnke, Michael Leeen_US
dc.contributor.committeememberWiller, Cristen J.en_US
dc.contributor.committeememberAbecasis, Goncaloen_US
dc.contributor.committeememberSong, Peter Xuekunen_US
dc.contributor.committeememberKang, Hyun Minen_US
dc.contributor.committeememberLee, Seunggeunen_US
dc.subject.hlbsecondlevelGeneticsen_US
dc.subject.hlbsecondlevelPublic Healthen_US
dc.subject.hlbsecondlevelStatistics and Numeric Dataen_US
dc.subject.hlbtoplevelHealth Sciencesen_US
dc.subject.hlbtoplevelScienceen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/110435/1/maclemen_1.pdf
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.