Importance of Different Types of Prior Knowledge in Selecting Genome‐Wide Findings for Follow‐Up

Minelli, Cosetta; De Grandi, Alessandro; Weichenberger, Christian X.; Gögele, Martin; Modenese, Mirko; Attia, John; Barrett, Jennifer H.; Boehnke, Michael; Borsani, Giuseppe; Casari, Giorgio; Fox, Caroline S.; Freina, Thomas; Hicks, Andrew A.; Marroni, Fabio; Parmigiani, Giovanni; Pastore, Andrea; Pattaro, Cristian; Pfeufer, Arne; Ruggeri, Fabrizio; Schwienbacher, Christine; Taliun, Daniel; Pramstaller, Peter P.; Domingues, Francisco S.; Thompson, John R.

Importance of Different Types of Prior Knowledge in Selecting Genome‐Wide Findings for Follow‐Up

dc.contributor.author	Minelli, Cosetta	en_US
dc.contributor.author	De Grandi, Alessandro	en_US
dc.contributor.author	Weichenberger, Christian X.	en_US
dc.contributor.author	Gögele, Martin	en_US
dc.contributor.author	Modenese, Mirko	en_US
dc.contributor.author	Attia, John	en_US
dc.contributor.author	Barrett, Jennifer H.	en_US
dc.contributor.author	Boehnke, Michael	en_US
dc.contributor.author	Borsani, Giuseppe	en_US
dc.contributor.author	Casari, Giorgio	en_US
dc.contributor.author	Fox, Caroline S.	en_US
dc.contributor.author	Freina, Thomas	en_US
dc.contributor.author	Hicks, Andrew A.	en_US
dc.contributor.author	Marroni, Fabio	en_US
dc.contributor.author	Parmigiani, Giovanni	en_US
dc.contributor.author	Pastore, Andrea	en_US
dc.contributor.author	Pattaro, Cristian	en_US
dc.contributor.author	Pfeufer, Arne	en_US
dc.contributor.author	Ruggeri, Fabrizio	en_US
dc.contributor.author	Schwienbacher, Christine	en_US
dc.contributor.author	Taliun, Daniel	en_US
dc.contributor.author	Pramstaller, Peter P.	en_US
dc.contributor.author	Domingues, Francisco S.	en_US
dc.contributor.author	Thompson, John R.	en_US
dc.date.accessioned	2013-02-12T19:00:27Z
dc.date.available	2014-04-02T15:08:08Z	en_US
dc.date.issued	2013-02	en_US
dc.identifier.citation	Minelli, Cosetta; De Grandi, Alessandro; Weichenberger, Christian X.; Gögele, Martin ; Modenese, Mirko; Attia, John; Barrett, Jennifer H.; Boehnke, Michael; Borsani, Giuseppe; Casari, Giorgio; Fox, Caroline S.; Freina, Thomas; Hicks, Andrew A.; Marroni, Fabio; Parmigiani, Giovanni; Pastore, Andrea; Pattaro, Cristian; Pfeufer, Arne; Ruggeri, Fabrizio; Schwienbacher, Christine; Taliun, Daniel; Pramstaller, Peter P.; Domingues, Francisco S.; Thompson, John R. (2013). "Importance of Different Types of Prior Knowledge in Selecting Genomeâ Wide Findings for Followâ Up." Genetic Epidemiology 37(2): 205-213. <http://hdl.handle.net/2027.42/96262>	en_US
dc.identifier.issn	0741-0395	en_US
dc.identifier.issn	1098-2272	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/96262
dc.description.abstract	Biological plausibility and other prior information could help select genome‐wide association ( GWA ) findings for further follow‐up, but there is no consensus on which types of knowledge should be considered or how to weight them. We used experts’ opinions and empirical evidence to estimate the relative importance of 15 types of information at the single‐nucleotide polymorphism ( SNP ) and gene levels. Opinions were elicited from 10 experts using a two‐round Delphi survey. Empirical evidence was obtained by comparing the frequency of each type of characteristic in SNP s established as being associated with seven disease traits through GWA meta‐analysis and independent replication, with the corresponding frequency in a randomly selected set of SNP s. SNP and gene characteristics were retrieved using a specially developed bioinformatics tool. Both the expert and the empirical evidence rated previous association in a meta‐analysis or more than one study as conferring the highest relative probability of true association, whereas previous association in a single study ranked much lower. High relative probabilities were also observed for location in a functional protein domain, although location in a region evolutionarily conserved in vertebrates was ranked high by the data but not by the experts. Our empirical evidence did not support the importance attributed by the experts to whether the gene encodes a protein in a pathway or shows interactions relevant to the trait. Our findings provide insight into the selection and weighting of different types of knowledge in SNP or gene prioritization, and point to areas requiring further research.	en_US
dc.publisher	Wiley Periodicals, Inc.	en_US
dc.publisher	Jessica Kingsley	en_US
dc.subject.other	Bioinformatics Databases	en_US
dc.subject.other	Gene Prioritization	en_US
dc.subject.other	Genome‐Wide Association Studies	en_US
dc.title	Importance of Different Types of Prior Knowledge in Selecting Genome‐Wide Findings for Follow‐Up	en_US
dc.type	Article	en_US
dc.rights.robots	IndexNoFollow	en_US
dc.subject.hlbsecondlevel	Biological Chemistry	en_US
dc.subject.hlbsecondlevel	Genetics	en_US
dc.subject.hlbsecondlevel	Molecular, Cellular and Developmental Biology	en_US
dc.subject.hlbtoplevel	Health Sciences	en_US
dc.subject.hlbtoplevel	Science	en_US
dc.description.peerreviewed	Peer Reviewed	en_US
dc.identifier.pmid	23307621	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/96262/1/gepi21705.pdf
dc.identifier.doi	10.1002/gepi.21705	en_US
dc.identifier.source	Genetic Epidemiology	en_US
dc.identifier.citedreference	Robertson G, Bilenky M, Lin K, He A, Yuen W, Dagpinar M, Varhol R, Teague K, Griffith OL, Zhang X and others. 2006. cisRED: a database system for genome‐scale computational discovery of regulatory elements. Nucleic Acids Res 34: D68 – D73.	en_US
dc.identifier.citedreference	Aranda B, Achuthan P, Alam‐Faruque Y, Armean I, Bridge A, Derow C, Feuermann M, Ghanbarian AT, Kerrien S, Khadake J and others. 2010. The IntAct molecular interaction database in 2010. Nucleic Acids Res 38: D525 – D531.	en_US
dc.identifier.citedreference	Blake JA, Bult CJ, Kadin JA, Richardson JE, Eppig JT, Mouse Genome Database Group. 2011. The Mouse Genome Database ( MGD ): premier model organism resource for mammalian genomics and genetics. Nucleic Acids Res 39: D842 – D848.	en_US
dc.identifier.citedreference	Cantor RM, Lange K, Sinsheimer JS. 2010. Prioritizing GWAS results: a review of statistical methods and recommendations for their application. Am J Hum Genet 86: 6 – 22.	en_US
dc.identifier.citedreference	Chen J, Aronow BJ, Jegga AG. 2009. Disease candidate gene identification and prioritization using protein interaction networks. BMC Bioinformatics 10: 73.	en_US
dc.identifier.citedreference	Chen Y, Wang W, Zhou Y, Shields R, Chanda SK, Elston RC, Li J. 2011. In silico gene prioritization by integrating multiple data sources. PLoS One 6: e21137.	en_US
dc.identifier.citedreference	Coronary Artery Disease (C4D) Genetics Consortium. 2011. A genome‐wide association study in E uropeans and S outh A sians identifies five new loci for coronary artery disease. Nat Genet 43: 339 – 344.	en_US
dc.identifier.citedreference	Davies HT, Crombie IK, Tavakoli M. 1998. When can odds ratios mislead? BMJ 316: 989 – 991.	en_US
dc.identifier.citedreference	Elbers CC, van Eijk KR, Franke L, Mulder F, van der Schouw YT, Wijmenga C, Onland ‐Moret NC. 2009. Using genome‐wide pathway analysis to unravel the etiology of complex diseases. Genet Epidemiol 33: 419 – 431.	en_US
dc.identifier.citedreference	Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K and others. 2010. The Pfam protein families database. Nucleic Acids Res 38: D211 – D222.	en_US
dc.identifier.citedreference	Fleiss JL. 1993. The statistical basis of meta‐analysis. Stat Methods Med Res 2: 121 – 145.	en_US
dc.identifier.citedreference	Flicek P, Amode MR, Barrell D, Beal K, Brent S, Chen Y, Clapham P, Coates G, Fairley S, Fitzgerald S and others. 2011. Ensembl 2011. Nucleic Acids Res 39: D800 – D806.	en_US
dc.identifier.citedreference	Franke A, McGovern DP, Barrett JC, Wang K, Radford ‐Smith GL, Ahmad T, Lees CW, Balschun T, Lee J, Roberts R and others. 2010. Genome‐wide meta‐analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet 42: 1118 – 1125.	en_US
dc.identifier.citedreference	Gögele M, Minelli C, Thakkinstian A, Yurkiewich A, Pattaro C, Pramstaller PP, Little J, Attia J, Thompson JR. 2012. Methods for meta‐analyses of genome‐wide association studies: critical assessment of empirical evidence. Am J Epidemiol 175: 739 – 749.	en_US
dc.identifier.citedreference	Greene CS, Penrod NM, Williams SM, Moore JH. 2009. Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One 4: e5639.	en_US
dc.identifier.citedreference	Hardouin SN, Nagy A. 2000. Mouse models for human disease. Clin Genet 57: 237 – 244.	en_US
dc.identifier.citedreference	Higgins JP, Thompson SG, Deeks JJ, Altman DG. 2003. Measuring inconsistency in meta‐analyses. BMJ 327: 557 – 560.	en_US
dc.identifier.citedreference	Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA. 2009. Potential etiologic and functional implications of genome‐wide association loci for human diseases and traits. Proc Natl Acad Sci USA 106: 9362 – 9367.	en_US
dc.identifier.citedreference	Ioannidis JP. 2007. Non‐replication and inconsistency in the genome‐wide association setting. Hum Hered 64: 203 – 213.	en_US
dc.identifier.citedreference	John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS. 2004. Human Micro RNA targets. PLoS Biol 2: e363.	en_US
dc.identifier.citedreference	Kanehisa M, Goto S. 2000. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28: 27 – 30.	en_US
dc.identifier.citedreference	Kirouac DC, Saez‐Rodriguez J, Swantek J, Burke JM, Lauffenburger DA, Sorger PK. 2012. Creating and analyzing pathway and protein interaction compendia for modelling signal transduction networks. BMC Syst Biol 6: 29.	en_US
dc.identifier.citedreference	Köttgen A, Pattaro C, Boger CA, Fuchsberger C, Olden M, Glazer NL, Parsa A, Gao X, Yang Q, Smith AV and others. 2010. New loci associated with kidney function and chronic kidney disease. Nat Genet 42: 376 – 384.	en_US
dc.identifier.citedreference	Kraft P, Zeggini E, Ioannidis JP. 2009. Replication in genome‐wide association studies. Stat Sci 24: 561 – 573.	en_US
dc.identifier.citedreference	Levenstien MA, Klein RJ. 2011. Predicting functionally important SNP classes based on negative selection. BMC Bioinformatics 12: 26.	en_US
dc.identifier.citedreference	Liu YJ, Papasian CJ, Liu JF, Hamilton J, Deng HW. 2008. Is replication the gold standard for validating genome‐wide association findings? PLoS One 3: e4037.	en_US
dc.identifier.citedreference	Mells GF, Floyd JA, Morley KI, Cordell HJ, Franklin CS, Shin SY, Heneghan MA, Neuberger JM, Donaldson PT, Day DB and others. 2011. Genome‐wide association study identifies 12 new susceptibility loci for primary biliary cirrhosis. Nat Genet 43: 329 – 332.	en_US
dc.identifier.citedreference	Moreau Y, Tranchevent LC. 2012. Computational tools for prioritizing candidate genes: boosting disease gene discovery. Nat Rev Genet 13: 523 – 536.	en_US
dc.identifier.citedreference	Parikh H, Lyssenko V, Groop LC. 2009. Prioritizing genes for follow‐up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus. BMC Med Genomics 2: 72.	en_US
dc.identifier.citedreference	Rosenthal N, Brown S. 2007. The mouse ascending: perspectives for human‐disease models. Nat Cell Biol 9: 993 – 999.	en_US
dc.identifier.citedreference	Saccone SF, Saccone NL, Swan GE, Madden PA, Goate AM, Rice JP, Bierut LJ. 2008. Systematic biological prioritization after a genome‐wide association study: an application to nicotine dependence. Bioinformatics 24: 1805 – 1811.	en_US
dc.identifier.citedreference	Schunkert H, Konig IR, Kathiresan S, Reilly MP, Assimes TL, Holm H, Preuss M, Stewart AF, Barbalic M, Gieger C and others. 2011. Large‐scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat Genet 43: 333 – 338.	en_US
dc.identifier.citedreference	Sookoian S, Gianotti TF, Schuman M, Pirola CJ. 2009. Gene prioritization based on biological plausibility over genome wide association studies renders new loci associated with type 2 diabetes. Genet Med 11: 338 – 343.	en_US
dc.identifier.citedreference	Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, Lango Allen H, Lindgren CM, Luan J, M ägi R and others. 2010. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet 42: 937 – 948.	en_US
dc.identifier.citedreference	Stabenau A, McVicker G, Melsopp C, Proctor G, Clamp M, Birney E. 2004. The E nsembl core software libraries. Genome Res 14: 929 – 933.	en_US
dc.identifier.citedreference	Stahl EA, Raychaudhuri S, Remmers EF, Xie G, Eyre S, Thomson BP, Li Y, Kurreeman FA, Zhernakova A, Hinks A and others. 2010. Genome‐wide association study meta‐analysis identifies seven new rheumatoid arthritis risk loci. Nat Genet 42: 508 – 514.	en_US
dc.identifier.citedreference	Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G and others. 2004. A gene atlas of the mouse and human protein‐encoding transcriptomes. Proc Natl Acad Sci USA 101: 6062 – 6067.	en_US
dc.identifier.citedreference	Thomas DC, Conti DV, Baurley J, Nijhout F, Reed M, Ulrich CM. 2009. Use of pathway information in molecular epidemiology. Hum Genomics 4: 21 – 42.	en_US
dc.identifier.citedreference	Thompson JR, Gögele M, Weichenberger CX, Modenese M, Attia J, Barrett JH, Boehnke M, De Grandi A, Domingues FS, Hicks AA and others. 2012. A Bayesian method for calculating the probability of replication by combining SNP information and genome‐wide data. Genet Epidemiol 37: XX – XX.	en_US
dc.identifier.citedreference	Visel A, Minovitsky S, Dubchak I, Pennacchio LA. 2007. VISTA enhancer browser—a database of tissue‐specific human enhancers. Nucleic Acids Res 35: D88 – D92.	en_US
dc.identifier.citedreference	Voight BF, Scott LJ, Steinthorsdottir V, Morris AP, Dina C, Welch RP, Zeggini E, Huth C, Aulchenko YS, Thorleifsson G and others. 2010. Twelve type 2 diabetes susceptibility loci identified through large‐scale association analysis. Nat Genet 42: 579 – 589.	en_US
dc.identifier.citedreference	Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, Hodge CL, Haase J, Janes J, Huss JW 3rd and others. 2009. Bio GPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome Biol 10: R130.	en_US
dc.identifier.citedreference	Yu W, Gwinn M, Clyne M, Yesupriya A, Khoury MJ. 2008. A navigator for human genome epidemiology. Nat Genet 40: 124 – 125.	en_US
dc.identifier.citedreference	Zhong H, Yang X, Kaplan LM, Molony C, Schadt EE. 2010. Integrating pathway analysis and genetics of gene expression for genome‐wide association studies. Am J Hum Genet 86: 581 – 591.	en_US
dc.identifier.citedreference	Adler M, Ziglio E. 1996. Gazing into the Oracle: The Delphi Method and Its Application to Social Policy and Public Health. London: Jessica Kingsley.	en_US
dc.identifier.citedreference	Aerts S, Lambrechts D, Maity S, Van Loo P, Coessens B, De Smet F, Tranchevent LC, De Moor B, Marynen P, Hassan B and others. 2006. Gene prioritization through genomic data fusion. Nat Biotechnol 24: 537 – 544.	en_US
dc.identifier.citedreference	Akins RB, Tolson H, Cole BR. 2005. Stability of response characteristics of a Delphi panel: application of bootstrap data expansion. BMC Med Res Methodol 5: 37.	en_US
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: gepi21705.pdf
Size:: 671.9KB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.