Robust meta‐analysis of biobank‐based genome‐wide association studies with unbalanced binary phenotypes
dc.contributor.author | Dey, Rounak | |
dc.contributor.author | Nielsen, Jonas B. | |
dc.contributor.author | Fritsche, Lars G. | |
dc.contributor.author | Zhou, Wei | |
dc.contributor.author | Zhu, Huanhuan | |
dc.contributor.author | Willer, Cristen J. | |
dc.contributor.author | Lee, Seunggeun | |
dc.date.accessioned | 2019-06-20T17:04:59Z | |
dc.date.available | WITHHELD_14_MONTHS | |
dc.date.available | 2019-06-20T17:04:59Z | |
dc.date.issued | 2019-07 | |
dc.identifier.citation | Dey, Rounak; Nielsen, Jonas B.; Fritsche, Lars G.; Zhou, Wei; Zhu, Huanhuan; Willer, Cristen J.; Lee, Seunggeun (2019). "Robust meta‐analysis of biobank‐based genome‐wide association studies with unbalanced binary phenotypes." Genetic Epidemiology 43(5): 462-476. | |
dc.identifier.issn | 0741-0395 | |
dc.identifier.issn | 1098-2272 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/149520 | |
dc.description.abstract | With the availability of large‐scale biobanks, genome‐wide scale phenome‐wide association studies are being instrumental in discovering novel genetic variants associated with clinical phenotypes. As increasing number of such association results from different biobanks become available, methods to meta‐analyse those association results is of great interest. Because the binary phenotypes in biobank‐based studies are mostly unbalanced in their case–control ratios, very few methods can provide well‐calibrated tests for associations. For example, traditional Z‐score‐based meta‐analysis often results in conservative or anticonservative Type I error rates in such unbalanced scenarios. We propose two meta‐analysis strategies that can efficiently combine association results from biobank‐based studies with such unbalanced phenotypes, using the saddlepoint approximation‐based score test method. Our first method involves sharing the overall genotype counts from each study, and the second method involves sharing an approximation of the distribution of the score test statistic from each study using cubic Hermite splines. We compare our proposed methods with a traditional Z‐score‐based meta‐analysis strategy using numerical simulations and real data applications, and demonstrate the superior performance of our proposed methods in terms of Type I error control. | |
dc.publisher | Wiley Periodicals, Inc. | |
dc.publisher | Prentice‐Hall | |
dc.subject.other | case–control studies | |
dc.subject.other | biobank | |
dc.subject.other | GWAS | |
dc.subject.other | meta‐analysis | |
dc.subject.other | saddlepoint approximation | |
dc.title | Robust meta‐analysis of biobank‐based genome‐wide association studies with unbalanced binary phenotypes | |
dc.type | Article | |
dc.rights.robots | IndexNoFollow | |
dc.subject.hlbsecondlevel | Biological Chemistry | |
dc.subject.hlbsecondlevel | Genetics | |
dc.subject.hlbsecondlevel | Molecular, Cellular and Developmental Biology | |
dc.subject.hlbtoplevel | Health Sciences | |
dc.subject.hlbtoplevel | Science | |
dc.description.peerreviewed | Peer Reviewed | |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/149520/1/gepi22197.pdf | |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/149520/2/gepi22197-sup-0002-SPA_Meta_SuppB.pdf | |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/149520/3/gepi22197_am.pdf | |
dc.identifier.doi | 10.1002/gepi.22197 | |
dc.identifier.source | Genetic Epidemiology | |
dc.identifier.citedreference | Krokstad, S., Langhammer, A., Hveem, K., Holmen, T., Midthjell, K., Stene, T., … Holmen, J. ( 2013 ). Cohort profile: The HUNT study, Norway. International Journal of Epidemiology, 42 ( 4 ), 968 – 977. https://doi.org/10.1093/ije/dys095 | |
dc.identifier.citedreference | UK Biobank. ( 2015 ). Genotyping and quality control of UK Biobank, a large‐scale, extensively phenotyped prospective resource. Retrieved from http://biobank.ctsu.ox.ac.uk/crystal/docs/genotyping_qc.pdf. | |
dc.identifier.citedreference | Stuart, P. E., Nair, R. P., Tsoi, L. C., Tejasvi, T., Das, S., Kang, H. M., … Elder, J. T. ( 2015 ). Genome‐wide association analysis of psoriatic arthritis and cutaneous psoriasis reveals differences in their genetic architecture. American Journal of Human Genetics, 97 ( 6 ), 816 – 836. https://doi.org/10.1016/j.ajhg.2015.10.019 | |
dc.identifier.citedreference | Press, W. H., Flannery., B. P., Teukolsky, S. A., & Vetterling, W. T. ( 1992 ). Numerical recipes in FORTRAN: The art of scientific computing ( 2nd ed.). Cambridge, England: Cambridge University Press. | |
dc.identifier.citedreference | Ma, C., Blackwell, T., Boehnke, M., Scott, L. J., & investigators, G. D. ( 2013 ). Recommended joint and meta‐analysis strategies for case–control association testing of single low‐count variants. Genetic Epidemiology, 37 ( 6 ), 539 – 550. https://doi.org/10.1002/gepi.21742 | |
dc.identifier.citedreference | Barndorff‐Nielsen, O. E. ( 1990 ). Approximate interval probabilities. Journal of the Royal Statistical Society, 52 ( 3 ), 485 – 496. | |
dc.identifier.citedreference | Brent, R. P. ( 1973 ). Algorithms for minimization without derivatives. Englewood Cliffs, NJ: Prentice‐Hall. | |
dc.identifier.citedreference | Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., … Marchini, J. ( 2017 ). Genome‐wide genetic data on ~500,000 UK Biobank participants. bioRxiv. 166298. https://www.nature.com/articles/s41586‐018‐0579‐z | |
dc.identifier.citedreference | Cooper, H. M., Hedges, L. V., & Valentine, J. C. ( 2009 ). The handbook of research synthesis and meta‐analysis ( 2nd ed.). New York, NY: Russell Sage Foundation. | |
dc.identifier.citedreference | Daniels, H. E. ( 1954 ). Saddlepoint approximations in statistics. Annals of Mathematical Statistics, 25 ( 4 ), 631 – 650. https://doi.org/10.1214/aoms/1177728652 | |
dc.identifier.citedreference | Das, S., Forer, L., Schönherr, S., Sidore, C., Locke, A. E., Kwong, A., … Fuchsberger, C. ( 2016 ). Next‐generation genotype imputation service and methods. Nature Genetics, 48 ( 10 ), 1284 – 1287. https://doi.org/10.1038/ng.3656 | |
dc.identifier.citedreference | Dey, R., Schmidt, E. M., Abecasis, G. R., & Lee, S. ( 2017 ). A fast and accurate algorithm to test for binary phenotypes and its application to PheWAS. American Journal of Human Genetics, 101 ( 1 ), 37 – 49. https://doi.org/10.1016/j.ajhg.2017.05.014 | |
dc.identifier.citedreference | Evangelou, E., & Ioannidis, J. P. A. ( 2013 ). Meta‐analysis methods for genome‐wide association studies and beyond. Nature Reviews Genetics, 14 ( 6 ), 379 – 389. https://doi.org/10.1038/nrg3472 | |
dc.identifier.citedreference | Firth, D. ( 1993 ). Bias reduction of maximum likelihood estimates. Biometrika, 80 ( 1 ), 27 – 38. https://doi.org/10.1093/biomet/80.1.27 | |
dc.identifier.citedreference | Han, B., & Eskin, E. ( 2011 ). Random‐effects model aimed at discovering associations in meta‐analysis of genome‐wide association studies. American Journal of Human Genetics, 88 ( 5 ), 586 – 598. https://doi.org/10.1016/j.ajhg.2011.04.014 | |
dc.identifier.citedreference | Hebbring, S. J. ( 2014 ). The challenges, advantages and future of phenome‐wide association studies. Immunology, 141 ( 2 ), 157 – 165. https://doi.org/10.1111/imm.12195 | |
dc.identifier.citedreference | Howie, B. N., Donnelly, P., & Marchini, J. ( 2009 ). A flexible and accurate genotype imputation method for the next generation of genome‐wide association studies. PLOS Genetics, 5 ( 6 ), e1000529. https://doi.org/10.1371/journal.pgen.1000529 | |
dc.identifier.citedreference | Zhou, W., Nielsen, J. B., Fritsche, L. G., Dey, R., Elvestad, M. B., Wolford, B. N., … Lee, S. ( 2018 ). Efficiently controlling for case–control imbalance and sample relatedness in large‐scale genetic association studies. Nature Genetics, 50, 1335 – 1341. https://doi.org/10.1101/212357 | |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.