Using Population Mixtures to Optimize the Utility of Genomic Databases: Linkage Disequilibrium and Association Study Design in India
dc.contributor.author | Pemberton, Trevor J. | en_US |
dc.contributor.author | Jakobsson, Mattias | en_US |
dc.contributor.author | Conrad, D. F. | en_US |
dc.contributor.author | Coop, G. | en_US |
dc.contributor.author | Wall, J. D. | en_US |
dc.contributor.author | Pritchard, Jonathan K. | en_US |
dc.contributor.author | Patel, P. I. | en_US |
dc.contributor.author | Rosenberg, Noah A. | en_US |
dc.date.accessioned | 2010-04-01T15:28:06Z | |
dc.date.available | 2010-04-01T15:28:06Z | |
dc.date.issued | 2008-07 | en_US |
dc.identifier.citation | Pemberton, T. J.; Jakobsson, M.; Conrad, D. F.; Coop, G.; Wall, J. D.; Pritchard, J. K.; Patel, P. I.; Rosenberg, N. A. (2008). "Using Population Mixtures to Optimize the Utility of Genomic Databases: Linkage Disequilibrium and Association Study Design in India." Annals of Human Genetics 72(4): 535-546. <http://hdl.handle.net/2027.42/65949> | en_US |
dc.identifier.issn | 0003-4800 | en_US |
dc.identifier.issn | 1469-1809 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/65949 | |
dc.identifier.uri | http://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=18513279&dopt=citation | en_US |
dc.description.abstract | When performing association studies in populations that have not been the focus of large-scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis – such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation. | en_US |
dc.format.extent | 619283 bytes | |
dc.format.extent | 3110 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.publisher | Blackwell Publishing Ltd | en_US |
dc.rights | Journal compilation © 2008 University College London | en_US |
dc.subject.other | Association Mapping | en_US |
dc.subject.other | Bengali | en_US |
dc.subject.other | Indian Population | en_US |
dc.subject.other | Portability | en_US |
dc.subject.other | Single-nucleotide Polymorphism | en_US |
dc.subject.other | Tamil | en_US |
dc.title | Using Population Mixtures to Optimize the Utility of Genomic Databases: Linkage Disequilibrium and Association Study Design in India | en_US |
dc.type | Article | en_US |
dc.rights.robots | IndexNoFollow | en_US |
dc.subject.hlbsecondlevel | Genetics | en_US |
dc.subject.hlbtoplevel | Health Sciences | en_US |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.contributor.affiliationum | Department of Human Genetics and Center for Computational Medicine and Biology, University of Michigan, 100 Washtenaw Ave., Ann Arbor, Michigan 48109 USA | en_US |
dc.contributor.affiliationother | Institute for Genetic Medicine, University of Southern California, 2250 Alcazar St., Los Angeles, California 90033 USA | en_US |
dc.contributor.affiliationother | Department of Human Genetics, University of Chicago, 920 East 58th St., Chicago, Illinois 60637 USA | en_US |
dc.contributor.affiliationother | Department of Epidemiology and Biostatistics, University of California, San Francisco, California 94107 USA | en_US |
dc.identifier.pmid | 18513279 | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/65949/1/j.1469-1809.2008.00457.x.pdf | |
dc.identifier.doi | 10.1111/j.1469-1809.2008.00457.x | en_US |
dc.identifier.source | Annals of Human Genetics | en_US |
dc.identifier.citedreference | Ahmadi, K. R., Weale, M. E., Xue, Z. Y., Soranzo, N., Yarnall, D. P., Briley, J. D., Maruyama, Y., Kobayashi, M., Wood, N. W., Spurr, N. K., Burns, D. K., Roses, A. D., Saunders, A. M. & Goldstein, D. B. ( 2005 ) A single-nucleotide polymorphism tagging set for human drug metabolism and transport. Nat Genet 37, 84 – 89. | en_US |
dc.identifier.citedreference | AndrÉs, A. M., Clark, A. G., Shimmin, L., Boerwinkle, E., Sing, C. F. & Hixson, J. E. ( 2007 ) Understanding the accuracy of statistical haplotype inference with sequence data of known phase. Genet Epidemiol 31, 659 – 671. | en_US |
dc.identifier.citedreference | Beaty, T. H., Fallin, M. D., Hetmanski, J. B., McIntosh, I., Chong, S. S., Ingersoll, R., Sheng, X., Chakraborty, R. & Scott, A. F. ( 2005 ) Haplotype diversity in 11 candidate genes across four populations. Genetics 171, 259 – 267. | en_US |
dc.identifier.citedreference | Cann, H. M., de Toma, C., Cazes, L., Legrand, M.-F., Morel, V., Piouffre, L., Bodmer, J., Bodmer, W. F., Bonne-Tamir, B., Cambon-Thomsen, A., Chen, Z., Chu, J., Carcassi, C., Contu, L., Du, R., Excoffier, L., Ferrara, G. B., Friedlaender, J. S., Groot, H., Gurwitz, D., Jenkins, T., Herrera, R. J., Huang, X., Kidd, J., Kidd, K. K., Langaney, A., Lin, A. A., Mehdi, S. Q., Parham, P., Piazza, A., Pistillo, M. P., Qian, Y., Shu, Q., Xu, J., Zhu, S., Weber, J. L., Greely, H. T., Feldman, M. W., Thomas, G., Dausset, J. & Cavalli-Sforza, L. L. ( 2002 ) A human genome diversity cell line panel. Science 296, 261 – 262. | en_US |
dc.identifier.citedreference | Carlson, C. S., Eberle, M. A., Rieder, M. J., Yi, Q., Kruglyak, L. & Nickerson, D. A. ( 2004 ) Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet 74, 106 – 120. | en_US |
dc.identifier.citedreference | Cha, P.-C., Yamada, R., Sekine, A., Nakamura, Y. & Koh, C.-L. ( 2004 ) Inference from the relationships between linkage disequilibrium and allele frequency distributions of 240 candidate SNPs in 109 drug-related genes in four Asian populations. J Hum Genet 49, 558 – 572. | en_US |
dc.identifier.citedreference | Conrad, D. F., Jakobsson, M., Coop, G., Wen, X., Wall, J. D., Rosenberg, N. A. & Pritchard, J. K. ( 2006 ) A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet 38, 1251 – 1260. | en_US |
dc.identifier.citedreference | de Bakker, P. I., Graham, R. R., Altshuler, D., Henderson, B. E. & Haiman, C. A. ( 2006a ) Transferability of tag SNPs to capture common genetic variation in DNA repair genes across multiple populations. Pac Symp Biocomput 11, 478 – 486. | en_US |
dc.identifier.citedreference | de Bakker, P. I. W., Burtt, N. P., Graham, R. R., Guiducci, C., Yelensky, R., Drake, J. A., Bersaglieri, T., Penney, K. L., Butler, J., Young, S., Onofrio, R. C., Lyon, H. N., Stram, D. O., Haiman, C. A., Freedman, M. L., Zhu, X., Cooper, R., Groop, L., Kolonel, L. N., Henderson, B. E., Daly, M. J., Hirschhorn, J. N. & Altshuler, D. ( 2006b ) Transferability of tag SNPs in genetic association studies in multiple populations. Nat Genet 38, 1298 – 1303. | en_US |
dc.identifier.citedreference | Gabriel, S. B., Schaffner, S. F., Nguyen, H., Moore, J. M., Roy, J., Blumenstiel, B., Higgins, J., DeFelice, M., Lochner, A., Faggart, M., Liu-Cordero, S. N., Rotimi, C., Adeyemo, A., Cooper, R., Ward, R., Lander, E. S., Daly, M. J. & Altshuler, D. ( 2002 ) The structure of haplotype blocks in the human genome. Science 296, 2225 – 2229. | en_US |
dc.identifier.citedreference | GonzÁlez-Neira, A., Ke, X., Lao, O., Calafell, F., Navarro, A., Comas, D., Cann, H., Bumpstead, S., Ghori, J., Hunt, S., Deloukas, P., Dunham, I., Cardon, L. R. & Bertranpetit, J. ( 2006 ) The portability of tagSNPs across populations: a worldwide survey. Genome Res 16, 323 – 330. | en_US |
dc.identifier.citedreference | Gu, C. C., Yu, K., Ketkar, S., Templeton, A. R. & Rao, D. C. ( 2008 ) On transferability of genome-wide tagSNPs. Genet Epidemiol 32, 89 – 97. | en_US |
dc.identifier.citedreference | Gu, S., Pakstis, A. J., Li, H., Speed, W. C., Kidd, J. R. & Kidd, K. K. ( 2007 ) Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations. Eur J Hum Genet 15, 302 – 312. | en_US |
dc.identifier.citedreference | Hinds, D. A., Stuve, L. L., Nilsen, G. B., Halperin, E., Eskin, E., Ballinger, D. G., Frazer, K. A. & Cox, D. R. ( 2005 ) Whole-genome patterns of common DNA variation in three human populations. Science 307, 1072 – 1079. | en_US |
dc.identifier.citedreference | Howie, B. N., Carlson, C. S., Rieder, M. J. & Nickerson, D. A. ( 2006 ) Efficient selection of tagging single-nucleotide polymorphisms in multiple populations. Hum Genet 120, 58 – 68. | en_US |
dc.identifier.citedreference | Huang, W., He, Y., Wang, H., Wang, Y., Liu, Y., Wang, Y., Chu, X., Wang, Y., Xu, L., Shen, Y., Xiong, X., Li, H., Wen, B., Qian, J., Yuan, W., Zhang, C., Wang, Y., Jiang, H., Zhao, G., Chen, Z. & Jin, L. ( 2006 ) Linkage disequilibrium sharing and haplotype-tagged SNP portability between populations. Proc Nat Acad Sci USA 103, 1418 – 1421. | en_US |
dc.identifier.citedreference | The International HapMap Consortium ( 2005 ) A haplotype map of the human genome. Nature 437, 1299 – 1320. | en_US |
dc.identifier.citedreference | The International HapMap Consortium ( 2007 ) A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851 – 861. | en_US |
dc.identifier.citedreference | Johansson, A., Vavruch-Nilsson, V., Cox, D. R., Frazer, K. A. & Gyllensten, U. ( 2007 ) Evaluation of the SNP tagging approach in an independent population sample: array-based SNP discovery in Sami. Hum Genet 122, 141 – 150. | en_US |
dc.identifier.citedreference | Landwehr, N., MielikÄinen, T., Eronen, L., Toivonen, H. & Mannila, H. ( 2007 ) Constrained hidden Markov models for population-based haplotyping. BMC Bioinformatics 8, S9. | en_US |
dc.identifier.citedreference | Li, X. & Li, J. ( 2007 ) Comparison of haplotyping methods using families and unrelated individuals on simulated rheumatoid arthritis data. BMC Proc 1, S55. | en_US |
dc.identifier.citedreference | Lim, J., Kim, Y. J., Yoon, Y., Kim, S. O., Kang, H., Park, J., Han, A. R., Han, B., Oh, B., Kimm, K., Yoon, B. & Song, K. ( 2006 ) Comparative study of the linkage disequilibrium of an ENCODE region, chromosome 7p15, in Korean, Japanese, and Han Chinese samples. Genomics 87, 392 – 398. | en_US |
dc.identifier.citedreference | Mahasirimongkol, S., Chantratita, W., Promso, S., Pasomsab, E., Jinawath, N., Jongjaroenprasert, W., Lulitanond, V., Krittayapoositpot, P., Tongsima, S., Sawanpanyalert, P., Kamatani, N., Nakamura, Y. & Sura, T. ( 2006 ) Similarity of the allele frequency and linkage disequilibrium pattern of single nucleotide polymorphisms in drug-related gene loci between Thai and northern East Asian populations: implications for tagging SNP selection in Thais. J Hum Genet 51, 896 – 904. | en_US |
dc.identifier.citedreference | Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. ( 2007 ) A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet 39, 906 – 913. | en_US |
dc.identifier.citedreference | Marvelle, A. F., Lange, L. A., Qin, L., Wang, Y., Lange, E. M., Adair, L. S. & Mohlke, K. L. ( 2007 ) Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples. J Hum Genet 52, 729 – 737. | en_US |
dc.identifier.citedreference | Montpetit, A., Nelis, M., Laflamme, P., Magi, R., Ke, X., Remm, M., Cardon, L., Hudson, T. J. & Metspalu, A. ( 2006 ) An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population. PLoS Genet 2, 282 – 290. | en_US |
dc.identifier.citedreference | Mueller, J. C., LÕhmussaar, E., MÄgi, R., Remm, M., Bettecken, T., Lichtner, P., Biskup, S., Illig, T., Pfeufer, A., Luedemann, J., Schreiber, S., Pramstaller, P., Pichler, I., Romeo, G., Gaddi, A., Testa, A., Wichmann, H.-E., Metspalu, A. & Meitinger, T. ( 2005 ) Linkage disequilibrium patterns and tagSNP transferability among European populations. Am J Hum Genet 76, 387 – 398. | en_US |
dc.identifier.citedreference | Nei, M. & Li, W.-H. ( 1973 ) Linkage disequilibrium in subdivided populations. Genetics 75, 213 – 219. | en_US |
dc.identifier.citedreference | Ohta, T. ( 1982 ) Linkage disequilibrium due to random genetic drift in finite subdivided populations. Proc Nat Acad Sci USA 79, 1940 – 1944. | en_US |
dc.identifier.citedreference | Prasad, P. & Thelma, B. K. ( 2007 ) Normative genetic profiles of RAAS pathway gene polymorphisms in North Indian and South Indian populations. Hum Biol 79, 241 – 254. | en_US |
dc.identifier.citedreference | Raj, S. M., Chakraborty, R., Wang, N. & Govindaraju, D. R. ( 2006 ) Linkage disequilibria and haplotype structure of four SNPs of the interleukin 1 gene cluster in seven Asian Indian populations. Hum Biol 78, 109 – 119. | en_US |
dc.identifier.citedreference | Reich, D. E., Cargill, M., Bolk, S., Ireland, J., Sabeti, P. C., Richter, D. J., Lavery, T., Kouyoumjian, R., Farhadian, S. F., Ward, R. & Lander, E. S. ( 2001 ) Linkage disequilibrium in the human genome. Nature 411, 199 – 204. | en_US |
dc.identifier.citedreference | Ribas, G., GonzÁlez-Neira, A., Salas, A., Milne, R. L., Vega, A., Carracedo, B., GonzÁlez, E., Barroso, E., FernÁndez, L. P., Yankilevich, P., Robledo, M., Carracedo, A. & BenÍtez, J. ( 2006 ) Evaluating HapMap SNP data transferability in a large-scale genotyping project involving 175 cancer-associated genes. Hum Genet 118, 669 – 679. | en_US |
dc.identifier.citedreference | Roberts, A., McMillan, L., Wang, W., Parker, J., Rusyn, I. & Threadgill, D. ( 2007 ) Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows. Bioinformatics 23, i401-i407. | en_US |
dc.identifier.citedreference | Rosenberg, N. A., Mahajan, S., Gonzalez-Quevedo, C., Blum, M. G.B., Nino-Rosales, L., Ninis, V., Das, P., Hegde, M., Molinari, L., Zapata, G., Weber, J. L., Belmont, J. W. & Patel, P. I. ( 2006 ) Low levels of genetic divergence across geographically and linguistically diverse populations of India. PLoS Genet 2, 2052 – 2061. | en_US |
dc.identifier.citedreference | Roy, N. S., Farheen, S., Roy, N., Sengupta, S. & Majumder, P. P. ( 2008 ) Portability of tag SNPs across isolated population groups: an example from India. Ann Hum Genet 72, 82 – 89. | en_US |
dc.identifier.citedreference | Sawyer, S. L., Mukherjee, N., Pakstis, A. J., Feuk, L., Kidd, J. R., Brookes, A. J. & Kidd, K. K. ( 2005 ) Linkage disequilibrium patterns vary substantially among populations. Eur J Hum Genet 13, 677 – 686. | en_US |
dc.identifier.citedreference | Scheet, P. & Stephens, M. ( 2006 ) A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet 78, 629 – 644. | en_US |
dc.identifier.citedreference | Sengupta, S., Farheen, S., Mukherjee, N., Dey, B., Mukhopadhyay, B., Sil, S. K., Prabhakaran, N., Ramesh, A., Edwin, D., Usha Rani, M. V., Mitra, M., Mahadik, C. T., Singh, S., Sehgal, S. C. & Majumder, P. P. ( 2004 ) DNA sequence variation and haplotype structure of the ICAM1 and TNF genes in 12 ethnic groups of India reveal patterns of importance in designing association studies. Ann Hum Genet 68, 574 – 587. | en_US |
dc.identifier.citedreference | Service, S., The International Collaborative Group on Isolated Populations, Sabatti, C. & Freimer, N. ( 2007 ) Tag SNPs chosen from HapMap perform well in several population isolates. Genet Epidemiol 31, 189 – 194. | en_US |
dc.identifier.citedreference | Servin, B. & Stephens, M. ( 2007 ) Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet 3, 1296 – 1308. | en_US |
dc.identifier.citedreference | Smith, E. M., Wang, X., Littrell, J., Eckert, J., Cole, R., Kissebah, A. H. & Olivier, M. ( 2006 ) Comparison of linkage disequilibrium patterns between the HapMap CEPH samples and a family-based cohort of Northern European descent. Genomics 88, 407 – 414. | en_US |
dc.identifier.citedreference | Stankovich, J., Cox, C. J., Tan, R. B., Montgomery, D. S., Huxtable, S. J., Rubio, J. P., Ehm, M. G., Johnson, L., Butzkueven, H., Kilpatrick, T. J., Speed, T. P., Roses, A. D., Bahlo, M. & Foote, S. J. ( 2006 ) On the utility of data from the International HapMap Project for Australian association studies. Hum Genet 119, 220 – 222. | en_US |
dc.identifier.citedreference | Tang, K., Ngoi, S.-M., Gwee, P.-C., Chua, J. M. Z., Lee, E. J. D., Chong, S. S. & Lee, C. G. L. ( 2002 ) Distinct haplotype profiles and strong linkage disequilibrium at the MDR1 multidrug transporter gene locus in three ethnic Asian populations. Pharmacogenetics 12, 437 – 450. | en_US |
dc.identifier.citedreference | Tishkoff, S. A. & Kidd, K. K. ( 2004 ) Implications of biogeography of human populations for ‘race’ and medicine. Nat Genet 36, S21 – S27. | en_US |
dc.identifier.citedreference | Vishwanathan, H., Edwin, D., Usha Rani, M. V. & Majumder, P. P. ( 2003 ) A survey of haplotype frequencies and linkage disequilibrium at the DRD2 locus in the Nilgiri hill tribes, South India. Curr Sci 84, 566 – 570. | en_US |
dc.identifier.citedreference | Weir, B. S. ( 1996 ) Genetic Data Analysis II. Sunderland, MA : Sinauer Associates. | en_US |
dc.identifier.citedreference | Willer, C. J., Scott, L. J., Bonnycastle, L. L., Jackson, A. U., Chines, P., Pruim, R., Bark, C. W., Tsai, Y.-Y., Pugh, E. W., Doheny, K. F., Kinnunen, L., Mohlke, K. L., Valle, T. T., Bergman, R. N., Tuomilehto, J., Collins, F. S. & Boehnke, M. ( 2006 ) Tag SNP selection for Finnish individuals based on the CEPH Utah HapMap database. Genet Epidemiol 30, 180 – 190. | en_US |
dc.identifier.citedreference | Xu, Z., Kaplan, N. L. & Taylor, J. A. ( 2007a ) Tag SNP selection for candidate gene association studies using HapMap and gene resequencing data. Eur J Hum Genet 15, 1063 – 1070. | en_US |
dc.identifier.citedreference | Xu, Z., Kaplan, N. L. & Taylor, J. A. ( 2007b ) TAGster: efficient selection of LD tag SNPs in single or multiple populations. Bioinformatics 23, 3254 – 3255. | en_US |
dc.identifier.citedreference | Yu, Z. & Schaid, D. J. ( 2007 ) Methods to impute missing genotypes for population data. Hum Genet 122, 495 – 504. | en_US |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.