Show simple item record

Appearance frequency modulated gene set enrichment testing

dc.contributor.authorMa, Jun
dc.contributor.authorSartor, Maureen A
dc.contributor.authorJagadish, HV
dc.date.accessioned2015-08-07T17:29:06Z
dc.date.available2015-08-07T17:29:06Z
dc.date.issued2011-03-20
dc.identifier.citationBMC Bioinformatics. 2011 Mar 20;12(1):81
dc.identifier.urihttps://hdl.handle.net/2027.42/112430en_US
dc.description.abstractAbstract Background Gene set enrichment testing has helped bridge the gap from an individual gene to a systems biology interpretation of microarray data. Although gene sets are defined a priori based on biological knowledge, current methods for gene set enrichment testing treat all genes equal. It is well-known that some genes, such as those responsible for housekeeping functions, appear in many pathways, whereas other genes are more specialized and play a unique role in a single pathway. Drawing inspiration from the field of information retrieval, we have developed and present here an approach to incorporate gene appearance frequency (in KEGG pathways) into two current methods, Gene Set Enrichment Analysis (GSEA) and logistic regression-based LRpath framework, to generate more reproducible and biologically meaningful results. Results Two breast cancer microarray datasets were analyzed to identify gene sets differentially expressed between histological grade 1 and 3 breast cancer. The correlation of Normalized Enrichment Scores (NES) between gene sets, generated by the original GSEA and GSEA with the appearance frequency of genes incorporated (GSEA-AF), was compared. GSEA-AF resulted in higher correlation between experiments and more overlapping top gene sets. Several cancer related gene sets achieved higher NES in GSEA-AF as well. The same datasets were also analyzed by LRpath and LRpath with the appearance frequency of genes incorporated (LRpath-AF). Two well-studied lung cancer datasets were also analyzed in the same manner to demonstrate the validity of the method, and similar results were obtained. Conclusions We introduce an alternative way to integrate KEGG PATHWAY information into gene set enrichment testing. The performance of GSEA and LRpath can be enhanced with the integration of appearance frequency of genes. We conclude that, generally, gene set analysis methods with the integration of information from KEGG PATHWAY performs better both statistically and biologically.
dc.titleAppearance frequency modulated gene set enrichment testing
dc.typeArticleen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/112430/1/12859_2010_Article_4457.pdf
dc.identifier.doi10.1186/1471-2105-12-81en_US
dc.language.rfc3066en
dc.rights.holderMa et al; licensee BioMed Central Ltd.
dc.date.updated2015-08-07T17:29:06Z
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.