Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
Genome Biol
2009 Jan 01;104:R44. doi: 10.1186/gb-2009-10-4-r44.
Show Gene links
Show Anatomy links
Choosing the right path: enhancement of biologically relevant sets of genes or proteins using pathway structure.
Thomas R
,
Gohlke JM
,
Stopper GF
,
Parham FM
,
Portier CJ
.
???displayArticle.abstract???
A method is proposed that finds enriched pathways relevant to a studied condition using the measured molecular data and also the structural information of the pathway viewed as a network of nodes and edges. Tests are performed using simulated data and genomic data sets and the method is compared to two existing approaches. The analysis provided demonstrates the method proposed is very competitive with the current approaches and also provides biologically relevant results.
Figure 1. Empirical distribution function of number of pathways associated with genes at given distances from terminal nodes. Empirical cumulative distribution function of the number of pathways that are associated with genes that have gene products located at a given distance, d (= 0, 1, 2, 3, 4), from a terminal node of the pathway network. Gene products that are at a distance d = 0 are the terminal gene products. The data used were those of all the genes associated with human signaling pathways in the KEGG pathway database [44].
Figure 2. Receiver-operator characteristic and positive predictive power versus negative predictive power plots for lung cancer data. (a) Sensitivity versus '1 - specificity' of enriched pathways that are predictive of survival from lung cancer for four methods: SEPEA_NT1, SEPEA_NT1*, GSEA and maxmean. SEPEA_NT1* is the same analysis as SEPEA_NT1 except that the pathway network information was not used. (b) Positive predictive power (ppp) versus negative predictive power (npp) for the same data and using the same methods of analysis as in (a).
Figure 3. Schematic of networks used to generate simulated data. Illustrative schematic of the two pathways used to generate the simulated data. (a) The Linear network of 30 nodes/gene products, each of which is associated with one gene. The pair of squiggly lines across some arrows is used to indicate that there are more nodes that are not shown. (b) The Erbb signaling pathway from the KEGG pathway database [44]. The expressions of the genes associated with the nodes circled in red are correlated with each other and are the genes that were affected by the treatment.
Al-Shahrour,
From genes to functional classes in the study of biological systems.
2007, Pubmed
Al-Shahrour,
From genes to functional classes in the study of biological systems.
2007,
Pubmed
Al-Shahrour,
FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes.
2004,
Pubmed
Barrett,
NCBI GEO: mining tens of millions of expression profiles--database and tools update.
2007,
Pubmed
Barry,
Significance analysis of functional categories in gene expression studies: a structured permutation approach.
2005,
Pubmed
Behmoaram,
Early stage cancer cell invasion: signaling, biomarkers and therapeutic targeting.
2008,
Pubmed
Beissbarth,
GOstat: find statistically overrepresented Gene Ontologies within a group of genes.
2004,
Pubmed
Chatzizacharias,
Focal adhesion kinase: a promising target for anticancer therapy.
2007,
Pubmed
Chen,
Significance analysis of groups of genes in expression profiling studies.
2007,
Pubmed
Dinu,
Improving gene set analysis of microarray data by SAM-GS.
2007,
Pubmed
Draghici,
A systems biology approach for pathway level analysis.
2007,
Pubmed
Eisen,
Cluster analysis and display of genome-wide expression patterns.
1998,
Pubmed
Elia,
Sonic hedgehog promotes proliferation and differentiation of adult muscle cells: Involvement of MAPK/ERK and PI3K/Akt pathways.
2007,
Pubmed
Goeman,
Testing association of a pathway with survival using gene expression data.
2005,
Pubmed
Goeman,
Analyzing gene expression data in terms of gene sets: methodological issues.
2007,
Pubmed
Goeman,
A global test for groups of genes: testing association with a clinical outcome.
2004,
Pubmed
Grosu,
Pathway Processor: a tool for integrating whole-genome expression results into metabolic networks.
2002,
Pubmed
Hanisch,
Co-clustering of biological networks and gene expression data.
2002,
Pubmed
Herrero,
GEPAS: A web-based resource for microarray gene expression data analysis.
2003,
Pubmed
Jiang,
Extensions to gene set enrichment.
2007,
Pubmed
Kanehisa,
From genomics to chemical genomics: new developments in KEGG.
2006,
Pubmed
Khatri,
Onto-Tools: an ensemble of web-accessible, ontology-based tools for the functional design and interpretation of high-throughput gene expression experiments.
2004,
Pubmed
Kim,
GAzer: gene set analyzer.
2007,
Pubmed
Kim,
PAGE: parametric analysis of gene set enrichment.
2005,
Pubmed
Liu,
Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.
2007,
Pubmed
Liu,
Comparative evaluation of gene-set analysis methods.
2007,
Pubmed
Liu,
Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models.
2008,
Pubmed
Maglietta,
Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data.
2007,
Pubmed
Maslov,
Specificity and stability in topology of protein networks.
2002,
Pubmed
Mootha,
PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.
2003,
Pubmed
Nam,
Gene-set approach for expression pattern analysis.
2008,
Pubmed
Nettleton,
Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis.
2008,
Pubmed
Osawa,
Sonic hedgehog stimulates the proliferation of rat gastric mucosal cells through ERK activation by elevating intracellular calcium concentration.
2006,
Pubmed
Pan,
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis.
2003,
Pubmed
Pandey,
Pathway Miner: extracting gene association networks from molecular pathways for predicting the biological significance of gene expression microarray data.
2004,
Pubmed
Raponi,
Gene expression signatures for predicting prognosis of squamous cell and adenocarcinomas of the lung.
2006,
Pubmed
Raychaudhuri,
Principal components analysis to summarize microarray experiments: application to sporulation time series.
2000,
Pubmed
Schaeffer,
Mitogen-activated protein kinases: specific messages from ubiquitous messengers.
1999,
Pubmed
Segal,
A module map showing conditional activity of expression modules in cancer.
2004,
Pubmed
Shah,
CLENCH: a program for calculating Cluster ENriCHment using the Gene Ontology.
2004,
Pubmed
Subramanian,
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.
2005,
Pubmed
Tian,
Discovering statistically significant pathways in expression profiling studies.
2005,
Pubmed
Tomfohr,
Pathway level analysis of gene expression using singular value decomposition.
2005,
Pubmed
Törönen,
Analysis of gene expression data using self-organizing maps.
1999,
Pubmed
Vert,
Extracting active pathways from gene expression data.
2003,
Pubmed
Wang,
An integrated approach for the analysis of biological pathways using mixed models.
2008,
Pubmed
Wei,
Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model.
2008,
Pubmed
Zeeberg,
GoMiner: a resource for biological interpretation of genomic and proteomic data.
2003,
Pubmed
Zhong,
ChipInfo: Software for extracting gene annotation and gene ontology information for microarray analysis.
2003,
Pubmed