Show simple item record

Integrative Statistical Learning with Applications to Predicting Features of Diseases and Health.

dc.contributor.authorHuang, Yongshengen_US
dc.date.accessioned2011-06-10T18:14:44Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2011-06-10T18:14:44Z
dc.date.issued2011en_US
dc.date.submitteden_US
dc.identifier.urihttps://hdl.handle.net/2027.42/84435
dc.description.abstractThis dissertation develops methods of integrative statistical learning to studies of two human diseases - respiratory infectious diseases and leukemia. It concerns integrating statistically principled approaches to connect data with knowledge for improved understanding of diseases. A wide spectrum of temporal and high-dimensional biological and medical datasets were considered. The first question studied in this thesis examined host responses to viral insult. In a human challenge study, eight transcriptional response patterns were identified in hosts experimentally challenged with influenza H3N2/Wisconsin viruses. These patterns are highly correlated with and predictive of symptoms. A non-passive asymptomatic state was revealed and associated with subclinical infections. The findings were validated and extended to three additional viral pathogens (influenza H1N1, Rhinovirus, and RSV). Their differences and similarities were compared and contrasted. Statistical models were developed for exposure detection and risk stratification. Experimental validations have been performed by collaborators at the Duke University. The second question studied in this thesis investigated the regulatory roles of Hoxa9 and Meis1 in hematopoiesis and leukemia. Methods were developed to characterize their global in vivo binding patterns and to identify their functional cofactors and collaborators. The combinatorial effects of these factors were modeled and related to specific epigenetic signatures. A new biological model was proposed to explain their synergistic functions in leukemic transformation. Experimental validations have been performed by members of the Hess laboratory. Motivated by problems encountered in these studies, two algorithms were developed to identify spatial and temporal patterns from high-throughput data. The first method determines temporal relationships between gene pathways during disease progression. It performs spectral analysis on graph Laplacian-embedded significance measures of pathway activity. The second algorithm proposes probabilistic modeling of protein binding events. Based on information geometry theory, it applies hypothesis testing coupled with jackknife-bias correction to characterize protein-protein interactions. Experimental validations were shown for both algorithms. In conclusion, this dissertation addressed issues in the design of statistical methods to identify characteristic and predictive features of human diseases. It demonstrated the effectiveness of integrating simple techniques in bioinformatics analysis. Several bioinformatics tools were developed to facilitate the analysis of high-dimensional time-series datasets.en_US
dc.language.isoen_USen_US
dc.subjectIntegrative Statistical Learning in High-dimensional Time-series Dataen_US
dc.subjectHost Transcriptional Responses to Respiratory Viral Pathogensen_US
dc.subjectRole of Hoxa9 in Leukemic Transformationen_US
dc.subjectSpectral Analysis of Temporal Pathway Activity Using Graph Lapalacianen_US
dc.subjectInformation Geometric Analysis of Motif Profiles in ChIP-sequencingen_US
dc.subjectPredictive Modeling and Classification in High-dimensional and Temporal Dataen_US
dc.titleIntegrative Statistical Learning with Applications to Predicting Features of Diseases and Health.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineBioinformaticsen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.contributor.committeememberHero Iii, Alfred O.en_US
dc.contributor.committeememberHess, Jay L.en_US
dc.contributor.committeememberBurns Jr., Daniel M.en_US
dc.contributor.committeememberOmenn, Gilbert S.en_US
dc.contributor.committeememberShedden, Kerby A.en_US
dc.subject.hlbsecondlevelBiomedical Engineeringen_US
dc.subject.hlbsecondlevelGeneticsen_US
dc.subject.hlbsecondlevelMicrobiology and Immunologyen_US
dc.subject.hlbsecondlevelPathologyen_US
dc.subject.hlbsecondlevelScience (General)en_US
dc.subject.hlbsecondlevelStatistics and Numeric Dataen_US
dc.subject.hlbtoplevelHealth Sciencesen_US
dc.subject.hlbtoplevelScienceen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/84435/1/huangys_1.pdf
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.