Fitting marginalized two‐part models to semicontinuous survey data arising from complex samples
dc.contributor.author | Smith, Valerie A. | |
dc.contributor.author | West, Brady T. | |
dc.contributor.author | Zhang, Shiyu | |
dc.date.accessioned | 2021-06-02T21:08:26Z | |
dc.date.available | 2022-07-02 17:08:24 | en |
dc.date.available | 2021-06-02T21:08:26Z | |
dc.date.issued | 2021-06 | |
dc.identifier.citation | Smith, Valerie A.; West, Brady T.; Zhang, Shiyu (2021). "Fitting marginalized two‐part models to semicontinuous survey data arising from complex samples." Health Services Research (3): 558-563. | |
dc.identifier.issn | 0017-9124 | |
dc.identifier.issn | 1475-6773 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/167816 | |
dc.description.abstract | ObjectiveTo accurately model semicontinuous data from complex surveys, we extend marginalized two‐part models to a design‐based inferential framework and provide guidance on incorporating complex sample designs.Data Sources2014 Medical Expenditure Panel Survey (MEPS).Study DesignWe describe the use of pseudo‐Maximum Likelihood Estimation and Jackknife Repeated Replication for estimating model parameters and sampling variance, respectively. We illustrate our approach using MEPS, modeling total healthcare expenditures in 2014 as a function of respondents’ age and family income. We provide SAS and R code for implementing the extension, assessing model‐fit indices, and evaluating the need to incorporate complex sampling features.Data Extraction MethodsData obtained from www.meps.ahrq.gov.Principle FindingsA 100 percentage‐point increase in family income as a percent of the federal poverty level was associated with a 5%‐6% increase in healthcare spending. People over 65 had an increase of 4‐5 times compared to those younger. Accounting for complex sampling in the models led to different parameter estimates and wider confidence intervals than the unweighted models. Ignoring complex sampling could lead to inaccurate finite population inference.ConclusionResearchers should account for complex sampling features when analyzing semicontinuous data from surveys. | |
dc.publisher | Chapman and Hall/CRC | |
dc.publisher | Wiley Periodicals, Inc. | |
dc.subject.other | marginalized two‐part models | |
dc.subject.other | healthcare expenditures | |
dc.subject.other | complex sample survey data | |
dc.title | Fitting marginalized two‐part models to semicontinuous survey data arising from complex samples | |
dc.type | Article | |
dc.rights.robots | IndexNoFollow | |
dc.subject.hlbsecondlevel | Public Health | |
dc.subject.hlbtoplevel | Health Sciences | |
dc.description.peerreviewed | Peer Reviewed | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/167816/1/hesr13648-sup-0001-Authormatrix.pdf | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/167816/2/hesr13648.pdf | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/167816/3/hesr13648_am.pdf | |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/167816/4/hesr13648-sup-0002-Supplement.pdf | |
dc.identifier.doi | 10.1111/1475-6773.13648 | |
dc.identifier.source | Health Services Research | |
dc.identifier.citedreference | Olsen MK, Schafer JL. A two‐part random‐effects model for semicontinuous longitudinal data. J Am Statist Assoc. 2001; 96 ( 454 ): 730 ‐ 745. | |
dc.identifier.citedreference | Liu L, Strawderman RL, Johnson BA, O’Quigley JM. Analyzing repeated measures semicontinuous data, with application to an alcohol dependence study. Stat Methods Med Res. 2016; 25 ( 1 ): 133 ‐ 152. | |
dc.identifier.citedreference | Su L, Tom BDM, Farewell VT. Bias in 2‐part mixed models for longitudinal semicontinuous data. Biostatistics. 2009; 10 ( 2 ): 374 ‐ 389. | |
dc.identifier.citedreference | Burgette JM, Preisser JS, Weinberger M, King RS, Lee JY, Rozier RG. Enrollment in early head start and oral health‐related quality of life. Qual Life Res. 2017; 26 ( 10 ): 2607 ‐ 2618. | |
dc.identifier.citedreference | Maciejewski ML, Livingston EH, Smith VA, Kahwati LC, Henderson WG, Arterburn DE. Health expenditures among high‐risk patients after gastric bypass and matched controls. Arch Surg. 2012; 147 ( 7 ): 633 ‐ 640. | |
dc.identifier.citedreference | Smith VA, Maciejewski ML, Olsen MK. Modeling semicontinuous longitudinal expenditures: a practical guide. Health Serv Res. 2018; 53: 3125 ‐ 3147. | |
dc.identifier.citedreference | Cragg JG. Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica. 1971; 39 ( 5 ): 829 ‐ 844. | |
dc.identifier.citedreference | Manning WG, Morris CN, Newhouse JP, et al. A two‐part model of the demand for medical care: preliminary results from the health insurance study. Health, economics, and health economics. 1981; 103 ‐ 123. | |
dc.identifier.citedreference | Duan N, Manning WG Jr, Morris CN, Newhouse JP. A comparison of alternative models for the demand of medical care. J Bus Econ Stat. 1983; 1 ( 2 ): 115 ‐ 126. | |
dc.identifier.citedreference | Neelon B, O’Malley AJ, Smith VA. Modeling zero‐modified count and semicontinuous data in health services research part 2: case studies. Stat Med. 2016; 35 ( 27 ): 5094 ‐ 5112. | |
dc.identifier.citedreference | Smith VA, Preisser JS, Neelon B, Maciejewski ML. A marginalized two‐part model for semicontinuous data. Stat Med. 2014; 33 ( 28 ): 4891 ‐ 4903. | |
dc.identifier.citedreference | Smith VA, Neelon B, Maciejewski ML, Preisser JS. Two parts are better than one: modeling marginal means of semicontinuous data. Health Serv Outcomes Res Method. 2017; 17 ( 3–4 ): 198 ‐ 218. | |
dc.identifier.citedreference | Smith VA, Neelon B, Preisser JS, Maciejewski ML. A marginalized two‐part model for longitudinal semicontinuous data. Stat Methods Med Res. 2017; 26 ( 4 ): 1949 ‐ 1968. | |
dc.identifier.citedreference | Heeringa SG, West BT, Berglund PA. Applied Survey Data Analysis. Boca Raton, FL: Chapman and Hall/CRC; 2017. | |
dc.identifier.citedreference | West BT, Sakshaug JW, Aurelien GAS. How big of a problem is analytic error in secondary analyses of survey data? PLoS One. 2016; 11 ( 6 ): e0158120. | |
dc.identifier.citedreference | Cohen JW, Cohen SB, Banthin JS. The medical expenditure panel survey: a national information resource to support healthcare cost research and inform policy and practice. Med Care. 2009; 47 ( 7 ): S44 ‐ S50. | |
dc.identifier.citedreference | Smith VA, Preisser JS. Direct and flexible marginal inference for semicontinuous data. Stat Meth Med Res. 2017; 26 ( 6 ): 2962 ‐ 2965. | |
dc.identifier.citedreference | Smith VA, Preisser JS. A marginalized two‐part model with heterogeneous variance for semicontinuous data. Stat Methods Med Res. 2019; 28 ( 5 ): 1412 ‐ 1426. | |
dc.identifier.citedreference | Azzalini A. A class of distributions which includes the normal ones. Scand J Stat. 1985; 12: 171 ‐ 178. | |
dc.identifier.citedreference | Chai HS, Bailey KR. Use of log‐skew‐normal distribution in analysis of continuous data with a discrete component at zero. Stat Med. 2008; 27 ( 18 ): 3643 ‐ 3655. | |
dc.identifier.citedreference | Manning WG, Basu A, Mullahy J. Generalized modeling approaches to risk adjustment of skewed outcomes data. J Health Econ. 2005; 24: 465 ‐ 488. | |
dc.identifier.citedreference | Liu L, Cowen ME, Strawderman RL, Shih YCT. A flexible two‐part random effects model for correlated medical costs. J Health Econ. 2010; 29 ( 1 ): 110 ‐ 123. | |
dc.identifier.citedreference | Binder DA. On the variances of asymptotically normal estimators from complex surveys. Int Stat Rev/Revue Internationale de Statistique. 1983; 51 ( 3 ): 279 ‐ 292. | |
dc.identifier.citedreference | R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Core Team; 2019. | |
dc.identifier.citedreference | Lumley T, Scott A. AIC and BIC for modeling with complex survey data. J Survey Stat Methodol. 2015; 3 ( 1 ): 1 ‐ 18. | |
dc.identifier.citedreference | Bollen KA, Biemer PP, Karr AF, Tueller S, Berzofsky ME. Are survey weights needed? A review of diagnostic tests in regression analysis. Ann Rev Stat Appl. 2016; 3: 375 ‐ 392. | |
dc.identifier.citedreference | Valliant R, Dever J. Survey Weights: A Step‐by‐Step Guide to Calculation. College Station, TX: Stata Press; 2018. | |
dc.identifier.citedreference | Pfeffermann D. Modelling of complex survey data: why model? why is it a problem? How can we approach it. Survey Methodol. 2011; 37 ( 2 ): 115 ‐ 136. | |
dc.working.doi | NO | en |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.