IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v3y2004i1n16.html
   My bibliography  Save this article

Calculating the Statistical Significance of Changes in Pathway Activity From Gene Expression Data

Author

Listed:
  • Rahnenführer Jörg

    (Max-Planck-Institute for Informatics, Saarbrücken, Germany)

  • Domingues Francisco S

    (Max-Planck-Institute for Informatics, Saarbrücken, Germany)

  • Maydt Jochen

    (Max-Planck-Institute for Informatics, Saarbrücken, Germany)

  • Lengauer Thomas

    (Max-Planck-Institute for Informatics, Saarbrücken, Germany)

Abstract

We present a statistical approach to scoring changes in activity of metabolic pathways from gene expression data. The method identifies the biologically relevant pathways with corresponding statistical significance. Based on gene expression data alone, only local structures of genetic networks can be recovered. Instead of inferring such a network, we propose a hypothesis-based approach. We use given knowledge about biological networks to improve sensitivity and interpretability of findings from microarray experiments.Recently introduced methods test if members of predefined gene sets are enriched in a list of top-ranked genes in a microarray study. We improve this approach by defining scores that depend on all members of the gene set and that also take pairwise co-regulation of these genes into account. We calculate the significance of co-regulation of gene sets with a nonparametric permutation test. On two data sets the method is validated and its biological relevance is discussed. It turns out that useful measures for co-regulation of genes in a pathway can be identified adaptively.We refine our method in two aspects specific to pathways. First, to overcome the ambiguity of enzyme-to-gene mappings for a fixed pathway, we introduce algorithms for selecting the best fitting gene for a specific enzyme in a specific condition. In selected cases, functional assignment of genes to pathways is feasible. Second, the sensitivity of detecting relevant pathways is improved by integrating information about pathway topology. The distance of two enzymes is measured by the number of reactions needed to connect them, and enzyme pairs with a smaller distance receive a higher weight in the score calculation.

Suggested Citation

  • Rahnenführer Jörg & Domingues Francisco S & Maydt Jochen & Lengauer Thomas, 2004. "Calculating the Statistical Significance of Changes in Pathway Activity From Gene Expression Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-31, June.
  • Handle: RePEc:bpj:sagmbi:v:3:y:2004:i:1:n:16
    DOI: 10.2202/1544-6115.1055
    as

    Download full text from publisher

    File URL: https://doi.org/10.2202/1544-6115.1055
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.2202/1544-6115.1055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lingxiang Wu & Xiujie Chen & Denan Zhang & Wubing Zhang & Lei Liu & Hongzhe Ma & Jingbo Yang & Hongbo Xie & Bo Liu & Qing Jin, 2016. "IGSA: Individual Gene Sets Analysis, including Enrichment and Clustering," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-16, October.
    2. Dørum Guro & Snipen Lars & Solheim Margrete & Saebo Solve, 2011. "Smoothing Gene Expression Data with Network Information Improves Consistency of Regulated Genes," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-26, August.
    3. Shojaie Ali & Michailidis George, 2010. "Network Enrichment Analysis in Complex Experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-36, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:3:y:2004:i:1:n:16. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.