The Development and Application of a Risk Index to Predict Individualized Chronic Disease Risk.
dc.contributor.author | Kelly, Reagan John | en_US |
dc.date.accessioned | 2010-01-07T16:24:31Z | |
dc.date.available | NO_RESTRICTION | en_US |
dc.date.available | 2010-01-07T16:24:31Z | |
dc.date.issued | 2009 | en_US |
dc.date.submitted | en_US | |
dc.identifier.uri | https://hdl.handle.net/2027.42/64650 | |
dc.description.abstract | Integrating clinical and genetic information to improve clinicians’ ability to estimate an individual’s disease risk is an important biomedical research challenge. This dissertation develops a “risk index” procedure that combines clinical data and genome-wide genotypes to make predictions about individuals’ risk of disease. For a set of 100 simulated datasets containing 1,000 individuals, 8 clinical covariates, 500 Single nucleotide polymorphisms, and an outcome prevalence of 30%, the average area under the receiver operating characteristics (ROC) curve (AUC) for a risk index model built with clinical covariates and SNPs was significantly higher than a model built with clinical covariates alone (0.846 vs. 0.832, p=0.0002). A risk index model that includes the principal components that account for 90% of the variability in the SNPs also significantly increased the average AUC compared to a clinical covariates only model (0.839 vs. 0.826, p=0.008). For a set of 25 simulated datasets containing 10,000 individuals, 29 clinical covariates, and 38,835 SNPs, a significant difference in average AUC was observed between clinical and clinical+genotype models (0.939 vs. 0.926, p=0.001), using the 500 SNPs most highly associated with the outcome. A risk index model including the 500 largest principal components of the 38,835 SNPs did not significantly increase the mean AUC beyond the clinical model (0.931 vs. 0.931, p=0.98). The risk index methodology was then applied to individuals from the Framingham Heart Study using 27 clinical covariates and 48,071 SNPs. Clinical+genotype risk index models built to predict ten-year incident hypertension, ten-year incident diabetes, or prevalent hypertension had AUCs of 0.475, 0.682, and 0.692, respectively, using the 500 SNPs most highly associated with the outcome, and AUCs of 0.563, 0.782, and 0.712, respectively, using the 500 largest principal components of the SNPs. The results from these analyses suggest that the risk index methodology has utility for predicting an individual’s risk of developing a chronic disease, and that the use of principal components of a large set of SNPs in place of a smaller selected set of associated SNPs provides the best predictive performance. | en_US |
dc.format.extent | 16406693 bytes | |
dc.format.extent | 1373 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.language.iso | en_US | en_US |
dc.subject | Bioinformatics | en_US |
dc.subject | Risk Prediction | en_US |
dc.subject | Integrating Clinical and Genetic Data | en_US |
dc.subject | Machine Learning | en_US |
dc.title | The Development and Application of a Risk Index to Predict Individualized Chronic Disease Risk. | en_US |
dc.type | Thesis | en_US |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Bioinformatics | en_US |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | en_US |
dc.contributor.committeemember | Kardia, Sharon | en_US |
dc.contributor.committeemember | Jagadish, Hosagrahar V. | en_US |
dc.contributor.committeemember | Peyser, Patricia A. | en_US |
dc.contributor.committeemember | Shedden, Kerby | en_US |
dc.subject.hlbsecondlevel | Genetics | en_US |
dc.subject.hlbsecondlevel | Statistics and Numeric Data | en_US |
dc.subject.hlbtoplevel | Science | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/64650/1/reagank_1.pdf | |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.