IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1005379.html
   My bibliography  Save this article

Data-driven reverse engineering of signaling pathways using ensembles of dynamic models

Author

Listed:
  • David Henriques
  • Alejandro F Villaverde
  • Miguel Rocha
  • Julio Saez-Rodriguez
  • Julio R Banga

Abstract

Despite significant efforts and remarkable progress, the inference of signaling networks from experimental data remains very challenging. The problem is particularly difficult when the objective is to obtain a dynamic model capable of predicting the effect of novel perturbations not considered during model training. The problem is ill-posed due to the nonlinear nature of these systems, the fact that only a fraction of the involved proteins and their post-translational modifications can be measured, and limitations on the technologies used for growing cells in vitro, perturbing them, and measuring their variations. As a consequence, there is a pervasive lack of identifiability. To overcome these issues, we present a methodology called SELDOM (enSEmbLe of Dynamic lOgic-based Models), which builds an ensemble of logic-based dynamic models, trains them to experimental data, and combines their individual simulations into an ensemble prediction. It also includes a model reduction step to prune spurious interactions and mitigate overfitting. SELDOM is a data-driven method, in the sense that it does not require any prior knowledge of the system: the interaction networks that act as scaffolds for the dynamic models are inferred from data using mutual information. We have tested SELDOM on a number of experimental and in silico signal transduction case-studies, including the recent HPN-DREAM breast cancer challenge. We found that its performance is highly competitive compared to state-of-the-art methods for the purpose of recovering network topology. More importantly, the utility of SELDOM goes beyond basic network inference (i.e. uncovering static interaction networks): it builds dynamic (based on ordinary differential equation) models, which can be used for mechanistic interpretations and reliable dynamic predictions in new experimental conditions (i.e. not used in the training). For this task, SELDOM’s ensemble prediction is not only consistently better than predictions from individual models, but also often outperforms the state of the art represented by the methods used in the HPN-DREAM challenge.Author summary: Signaling pathways play a key role in complex diseases such as cancer, for which the development of novel therapies is a difficult, expensive and laborious task. Computational models that can predict the effect of a new combination of drugs without having to test it experimentally can help in accelerating this process. In particular, network-based dynamic models of these pathways hold promise to both understand and predict the effect of therapeutics. However, their use is currently hampered by limitations in our knowledge of the underlying biochemistry, as well as in the experimental and computational technologies used for calibrating the models. Thus, the results from such models need to be carefully interpreted and used in order to avoid biased predictions. Here we present a procedure that deals with this uncertainty by using experimental data to build an ensemble of dynamic models. The method incorporates steps to reduce overfitting and maximize predictive capability. We find that by combining the outputs of individual models in an ensemble it is possible to obtain a more robust prediction. We report results obtained with this method, which we call SELDOM (enSEmbLe of Dynamic lOgic-based Models), showing that it improves the predictions previously reported for several challenging problems.

Suggested Citation

  • David Henriques & Alejandro F Villaverde & Miguel Rocha & Julio Saez-Rodriguez & Julio R Banga, 2017. "Data-driven reverse engineering of signaling pathways using ensembles of dynamic models," PLOS Computational Biology, Public Library of Science, vol. 13(2), pages 1-25, February.
  • Handle: RePEc:plo:pcbi00:1005379
    DOI: 10.1371/journal.pcbi.1005379
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005379
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1005379&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1005379?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jeremiah J Faith & Boris Hayete & Joshua T Thaden & Ilaria Mogno & Jamey Wierzbowski & Guillaume Cottarel & Simon Kasif & James J Collins & Timothy S Gardner, 2007. "Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-13, January.
    2. Samuel Bandara & Johannes P Schlöder & Roland Eils & Hans Georg Bock & Tobias Meyer, 2009. "Optimal Experimental Design for Parameter Estimation of a Cell Signaling Model," PLOS Computational Biology, Public Library of Science, vol. 5(11), pages 1-12, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gabriele Scheler, 2013. "Transfer Functions for Protein Signal Transduction: Application to a Model of Striatal Neural Plasticity," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-13, February.
    2. Lulu Shang & Jennifer A Smith & Xiang Zhou, 2020. "Leveraging gene co-expression patterns to infer trait-relevant tissues in genome-wide association studies," PLOS Genetics, Public Library of Science, vol. 16(4), pages 1-30, April.
    3. Hossein Zare & Mostafa Kaveh & Arkady Khodursky, 2011. "Inferring a Transcriptional Regulatory Network from Gene Expression Data Using Nonlinear Manifold Embedding," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-7, August.
    4. Diambra, L., 2011. "Coarse-grain reconstruction of genetic networks from expression levels," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(11), pages 2198-2207.
    5. Marco Grimaldi & Roberto Visintainer & Giuseppe Jurman, 2011. "RegnANN: Reverse Engineering Gene Networks Using Artificial Neural Networks," PLOS ONE, Public Library of Science, vol. 6(12), pages 1-19, December.
    6. Ruonan Wu & Michelle R. Davison & William C. Nelson & Montana L. Smith & Mary S. Lipton & Janet K. Jansson & Ryan S. McClure & Jason E. McDermott & Kirsten S. Hofmockel, 2023. "Hi-C metagenome sequencing reveals soil phage–host interactions," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    7. Daniel Lobo & Michael Levin, 2015. "Inferring Regulatory Networks from Experimental Morphological Phenotypes: A Computational Method Reverse-Engineers Planarian Regeneration," PLOS Computational Biology, Public Library of Science, vol. 11(6), pages 1-28, June.
    8. repec:jss:jstsof:37:i01 is not listed on IDEAS
    9. Joeri Ruyssinck & Vân Anh Huynh-Thu & Pierre Geurts & Tom Dhaene & Piet Demeester & Yvan Saeys, 2014. "NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-13, March.
    10. Tom Wilderjans & Dirk Depril & Iven Van Mechelen, 2013. "Additive Biclustering: A Comparison of One New and Two Existing ALS Algorithms," Journal of Classification, Springer;The Classification Society, vol. 30(1), pages 56-74, April.
    11. Shuhei Kimura & Masanao Sato & Mariko Okada-Hatakeyama, 2013. "Inference of Vohradský's Models of Genetic Networks by Solving Two-Dimensional Function Optimization Problems," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-11, December.
    12. Xiaomeng Zhang & Bin Shao & Yangle Wu & Ouyang Qi, 2013. "A Reverse Engineering Approach to Optimize Experiments for the Construction of Biological Regulatory Networks," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-9, September.
    13. Takanori Hasegawa & Rui Yamaguchi & Masao Nagasaki & Satoru Miyano & Seiya Imoto, 2014. "Inference of Gene Regulatory Networks Incorporating Multi-Source Biological Knowledge via a State Space Model with L1 Regularization," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-19, August.
    14. Kannan Venkateshan & Tegner Jesper, 2016. "Adaptive input data transformation for improved network reconstruction with information theoretic algorithms," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(6), pages 507-520, December.
    15. Fei Liu & Shao-Wu Zhang & Wei-Feng Guo & Ze-Gang Wei & Luonan Chen, 2016. "Inference of Gene Regulatory Network Based on Local Bayesian Networks," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-17, August.
    16. Hirose, Kei & Fujisawa, Hironori & Sese, Jun, 2017. "Robust sparse Gaussian graphical modeling," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 172-190.
    17. Benafsh Husain & F Alex Feltus, 2019. "EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-15, August.
    18. Alejandro F Villaverde & Antonio Barreiro & Antonis Papachristodoulou, 2016. "Structural Identifiability of Dynamic Systems Biology Models," PLOS Computational Biology, Public Library of Science, vol. 12(10), pages 1-22, October.
    19. Zhen Yang & Yen‐Yi Ho, 2022. "Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data," Biometrics, The International Biometric Society, vol. 78(2), pages 766-776, June.
    20. Juliane Liepe & Sarah Filippi & Michał Komorowski & Michael P H Stumpf, 2013. "Maximizing the Information Content of Experiments in Systems Biology," PLOS Computational Biology, Public Library of Science, vol. 9(1), pages 1-13, January.
    21. Mingyi Wang & Jerome Verdier & Vagner A Benedito & Yuhong Tang & Jeremy D Murray & Yinbing Ge & Jörg D Becker & Helena Carvalho & Christian Rogers & Michael Udvardi & Ji He, 2013. "LegumeGRN: A Gene Regulatory Network Prediction Server for Functional and Comparative Studies," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-7, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1005379. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.