Show simple item record

Statistical Issues in the Analysis of Correlated Data.

dc.contributor.authorXia, Rongen_US
dc.date.accessioned2016-01-13T18:04:38Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2016-01-13T18:04:38Z
dc.date.issued2015en_US
dc.date.submitteden_US
dc.identifier.urihttps://hdl.handle.net/2027.42/116687
dc.description.abstractWe first extend the original classification and regression trees (CART) paradigm (Breiman et al. 1984) to clustered binary outcomes, where individuals within a cluster are correlated. We propose to generate tree models using residuals from a null generalized linear mixed model (with fixed and random intercepts only) as the outcome, which circumvents modeling the correlation structure explicitly while still accounting for the cluster-correlated design, thereby allowing us to adopt the original CART machinery in tree growing, pruning and cross-validation. Based on simulations, we find that our residual-based tree is more appropriate for analyzing clustered binary data, and provides more accurate classification predictions than the standard CART that ignores the clustering. We also illustrated our method using data from clinical studies, and residual-based trees identified clinically meaningful subgroups. Clinical attachment level (CAL) is a tooth-level measure that quantifies the severity of periodontal disease. The within-mouth correlation of tooth-level CAL is difficult to model because it must reflect the three-dimensional spatial geography of teeth and their functional similarity. In the second project, we propose two linear mixed effects (LME) models with random effects that quantify the within-mouth correlation of teeth and their shared functionality. Via simulations, we demonstrate that our mixed models give consistent and more efficient estimates than a t-test and generalized estimating equations that fail to model the within-mouth correlation accurately. We also evaluate the performance of the approaches when data are missing under different biologically plausible missing data mechanisms. Inference for the fixed effects in an LME model is dependent upon the correlation structure implied by the random effects included in the LME model. However, limited methods are available for making inference about the fit of the assumed covariance structure in the LME model. In the third project, we propose three permutation tests, all of which are based on comparing the estimated assumed covariance matrix to the covariance matrix of the marginal residuals. Cholesky residuals, which are exchangeable both within and among subjects, are employed in the permutations. Through simulations, we show that two of our tests have valid size and comparable power in testing different covariance structure assumptions.en_US
dc.language.isoen_USen_US
dc.subjectcorrelated dataen_US
dc.subjecttree-based methodsen_US
dc.subjectresidualsen_US
dc.subjectlinear mixed effects modelsen_US
dc.subjectrandom effectsen_US
dc.subjectcovariance model selectionen_US
dc.titleStatistical Issues in the Analysis of Correlated Data.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBiostatisticsen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.contributor.committeememberBraun, Thomas Men_US
dc.contributor.committeememberBanerjee, Mousumien_US
dc.contributor.committeememberFriese, Christopher Ryanen_US
dc.contributor.committeememberSanchez, Brisa Nen_US
dc.subject.hlbsecondlevelPublic Healthen_US
dc.subject.hlbtoplevelHealth Sciencesen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/116687/1/rongxia_1.pdf
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.