Show simple item record

Understanding the accuracy of statistical haplotype inference with sequence data of known phase

dc.contributor.authorAndrés, Aida M.en_US
dc.contributor.authorClark, Andrew G.en_US
dc.contributor.authorShimmin, Lawrenceen_US
dc.contributor.authorBoerwinkle, Ericen_US
dc.contributor.authorSing, Charles F.en_US
dc.contributor.authorHixson, James E.en_US
dc.date.accessioned2007-12-04T18:31:26Z
dc.date.available2008-11-05T15:05:43Zen_US
dc.date.issued2007-11en_US
dc.identifier.citationAndrÉs, Aida M.; Clark, Andrew G.; Shimmin, Lawrence; Boerwinkle, Eric; Sing, Charles F.; Hixson, James E. (2007). "Understanding the accuracy of statistical haplotype inference with sequence data of known phase." Genetic Epidemiology 31(7): 659-671. <http://hdl.handle.net/2027.42/57366>en_US
dc.identifier.issn0741-0395en_US
dc.identifier.issn1098-2272en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/57366
dc.identifier.urihttp://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=17922479&dopt=citation
dc.description.abstractStatistical methods for haplotype inference from multi-site genotypes of unrelated individuals have important application in association studies and population genetics. Understanding the factors that affect the accuracy of this inference is important, but their assessment has been restricted by the limited availability of biological data with known phase. We created hybrid cell lines monosomic for human chromosome 19 and produced single-chromosome complete sequences of a 48 kb genomic region in 39 individuals of African American (AA) and European American (EA) origin. We employ these phase-known genotypes and coalescent simulations to assess the accuracy of statistical haplotype reconstruction by several algorithms. Accuracy of phase inference was considerably low in our biological data even for regions as short as 25–50 kb, suggesting that caution is needed when analyzing reconstructed haplotypes. Moreover, the reliability of estimated confidence in phase inference is not high enough to allow for a reliable incorporation of site-specific uncertainty information in subsequent analyses. We show that, in samples of certain mixed ancestry (AA and EA populations), the most accurate haplotypes are probably obtained when increasing sample size by considering the largest, pooled sample, despite the hypothetical problems associated with pooling across those heterogeneous samples. Strategies to improve confidence in reconstructed haplotypes, and realistic alternatives to the analysis of inferred haplotypes, are discussed. Genet. Epidemiol . © 2007 Wiley-Liss, Inc.en_US
dc.format.extent378416 bytes
dc.format.extent3118 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.publisherWiley Subscription Services, Inc., A Wiley Companyen_US
dc.subject.otherLife and Medical Sciencesen_US
dc.subject.otherGeneticsen_US
dc.titleUnderstanding the accuracy of statistical haplotype inference with sequence data of known phaseen_US
dc.typeArticleen_US
dc.rights.robotsIndexNoFollowen_US
dc.subject.hlbsecondlevelBiological Chemistryen_US
dc.subject.hlbsecondlevelGeneticsen_US
dc.subject.hlbsecondlevelMolecular, Cellular and Developmental Biologyen_US
dc.subject.hlbtoplevelHealth Sciencesen_US
dc.subject.hlbtoplevelScienceen_US
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumDepartment of Human Genetics, University of Michigan, Ann Arbor, Michiganen_US
dc.contributor.affiliationotherDepartment of Molecular Biology and Genetics, Cornell University, Ithaca, New York ; National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50 Room 5527, Bethesda, MD 20892en_US
dc.contributor.affiliationotherDepartment of Molecular Biology and Genetics, Cornell University, Ithaca, New Yorken_US
dc.contributor.affiliationotherHuman Genetics Center, University of Texas Health Science Center, Houston, Texasen_US
dc.contributor.affiliationotherHuman Genetics Center, University of Texas Health Science Center, Houston, Texasen_US
dc.contributor.affiliationotherHuman Genetics Center, University of Texas Health Science Center, Houston, Texasen_US
dc.identifier.pmid17922479
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/57366/1/20185_ftp.pdfen_US
dc.identifier.doihttp://dx.doi.org/10.1002/gepi.20185en_US
dc.identifier.sourceGenetic Epidemiologyen_US
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.