Robust Methods for Estimating the Mean with Missing Data.
dc.contributor.author | Yang, Ye | en_US |
dc.date.accessioned | 2015-09-30T14:22:55Z | |
dc.date.available | NO_RESTRICTION | en_US |
dc.date.available | 2015-09-30T14:22:55Z | |
dc.date.issued | 2015 | en_US |
dc.date.submitted | en_US | |
dc.identifier.uri | https://hdl.handle.net/2027.42/113402 | |
dc.description.abstract | Missing data are common in many empirical studies. In this dissertation, we explore robust methods to estimate the mean of an outcome variable subject to nonresponse in the presence of fully observed covariates. In Chapter II, we consider data on a continuous outcome that are missing at random and a fully observed set of covariates. Doubly-robust (DR) estimators are consistent when either the regression model for the mean function or the propensity to respond (the “propensity model”) is correctly specified. We compare by simulation a variety of doubly-robust (DR) estimators for estimating the mean of the outcome. Penalized spline of propensity prediction (Zhang and Little 2009) and the augmented estimating equation method proposed in Cao, et al (2009) tended to outperform the other DR methods. In Chapter III, we consider estimating the mean of a continuous outcome that may be missing not at random (MNAR). Bivariate normal pattern-mixture models (BNPM; Little 1994) and proxy-pattern mixture models (PPMA; Andridge and Little 2011) have been proposed to estimate the mean of the outcome under varying assumptions about the missing data mechanism given one or more fully observed covariates. Both BNPM and PPMA assume normality. We propose a spline-proxy pattern mixture model (S-PPMA), which relaxes the normality assumption using a penalized spline. Properties of S-PPMA are assessed by simulation. Results show that S-PPMA yields estimates that are more robust than PPMA to deviations from normality, while trading off precision when normality assumption is met. In Chapter IV, we extend S-PPMA to a binary outcome (binS-PPMA), by assuming an underlying continuous latent variable that generates the binary outcome. We apply binS-PPMA to the latent variable to impute the binary outcome given information from observed covariates and assumptions about the nonresponse mechanism. As with continuous missing variables, binS-PPMA shows improvement in robustness to normality compared to bin-PPMA, with no important differences between the methods when all variables are normally distributed. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | missing data | en_US |
dc.subject | doubly robust | en_US |
dc.title | Robust Methods for Estimating the Mean with Missing Data. | en_US |
dc.type | Thesis | en_US |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Biostatistics | en_US |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | en_US |
dc.contributor.committeemember | Little, Roderick J. | en_US |
dc.contributor.committeemember | Wang, Naisyin | en_US |
dc.contributor.committeemember | Raghunathan, Trivellore E. | en_US |
dc.contributor.committeemember | Elliott, Michael R. | en_US |
dc.subject.hlbsecondlevel | Statistics and Numeric Data | en_US |
dc.subject.hlbtoplevel | Science | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/113402/1/yeya_1.pdf | |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.