IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003650.html
   My bibliography  Save this article

Model Selection in Systems Biology Depends on Experimental Design

Author

Listed:
  • Daniel Silk
  • Paul D W Kirk
  • Chris P Barnes
  • Tina Toni
  • Michael P H Stumpf

Abstract

Experimental design attempts to maximise the information available for modelling tasks. An optimal experiment allows the inferred models or parameters to be chosen with the highest expected degree of confidence. If the true system is faithfully reproduced by one of the models, the merit of this approach is clear - we simply wish to identify it and the true parameters with the most certainty. However, in the more realistic situation where all models are incorrect or incomplete, the interpretation of model selection outcomes and the role of experimental design needs to be examined more carefully. Using a novel experimental design and model selection framework for stochastic state-space models, we perform high-throughput in-silico analyses on families of gene regulatory cascade models, to show that the selected model can depend on the experiment performed. We observe that experimental design thus makes confidence a criterion for model choice, but that this does not necessarily correlate with a model's predictive power or correctness. Finally, in the special case of linear ordinary differential equation (ODE) models, we explore how wrong a model has to be before it influences the conclusions of a model selection analysis.Author Summary: Different models of the same process represent distinct hypotheses about reality. These can be decided between within the framework of model selection, where the evidence for each is given by their ability to reproduce a set of experimental data. Even if one of the models is correct, the chances of identifying it can be hindered by the quality of the data, both in terms of its signal to measurement error ratio and the intrinsic discriminatory potential of the experiment undertaken. This potential can be predicted in various ways, and maximising it is one aim of experimental design. In this work we present a computationally efficient method of experimental design for model selection. We exploit the efficiency to consider the implications of the realistic case where all models are more or less incorrect, showing that experiments can be chosen that, considered individually, lead to unequivocal support for opposed hypotheses.

Suggested Citation

  • Daniel Silk & Paul D W Kirk & Chris P Barnes & Tina Toni & Michael P H Stumpf, 2014. "Model Selection in Systems Biology Depends on Experimental Design," PLOS Computational Biology, Public Library of Science, vol. 10(6), pages 1-14, June.
  • Handle: RePEc:plo:pcbi00:1003650
    DOI: 10.1371/journal.pcbi.1003650
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003650
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003650&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003650?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mikael Sunnåker & Alberto Giovanni Busetto & Elina Numminen & Jukka Corander & Matthieu Foll & Christophe Dessimoz, 2013. "Approximate Bayesian Computation," PLOS Computational Biology, Public Library of Science, vol. 9(1), pages 1-10, January.
    2. Juliane Liepe & Sarah Filippi & Michał Komorowski & Michael P H Stumpf, 2013. "Maximizing the Information Content of Experiments in Systems Biology," PLOS Computational Biology, Public Library of Science, vol. 9(1), pages 1-13, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thembi Mdluli & Gregery T Buzzard & Ann E Rundell, 2015. "Efficient Optimization of Stimuli for Model-Based Design of Experiments to Resolve Dynamical Uncertainty," PLOS Computational Biology, Public Library of Science, vol. 11(9), pages 1-23, September.
    2. Gennady Gorin & John J. Vastola & Meichen Fang & Lior Pachter, 2022. "Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    3. Filip Melinscak & Dominik R Bach, 2020. "Computational optimization of associative learning experiments," PLOS Computational Biology, Public Library of Science, vol. 16(1), pages 1-23, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. George Karabatsos, 2023. "Approximate Bayesian computation using asymptotically normal point estimates," Computational Statistics, Springer, vol. 38(2), pages 531-568, June.
    2. David J Price & Alexandre Breuzé & Richard Dybowski & Piero Mastroeni & Olivier Restif, 2017. "An efficient moments-based inference method for within-host bacterial infection dynamics," PLOS Computational Biology, Public Library of Science, vol. 13(11), pages 1-27, November.
    3. Frederick Callaway & Antonio Rangel & Thomas L Griffiths, 2021. "Fixation patterns in simple choice reflect optimal information sampling," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-29, March.
    4. Aushev, Alexander & Pesonen, Henri & Heinonen, Markus & Corander, Jukka & Kaski, Samuel, 2022. "Likelihood-free inference with deep Gaussian processes," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    5. Andrew White & Malachi Tolman & Howard D Thames & Hubert Rodney Withers & Kathy A Mason & Mark K Transtrum, 2016. "The Limitations of Model-Based Experimental Design and Parameter Estimation in Sloppy Systems," PLOS Computational Biology, Public Library of Science, vol. 12(12), pages 1-26, December.
    6. Jonathan U Harrison & Ruth E Baker, 2020. "An automatic adaptive method to combine summary statistics in approximate Bayesian computation," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-21, August.
    7. Hazra, Indranil & Pandey, Mahesh D. & Manzana, Noldainerick, 2020. "Approximate Bayesian computation (ABC) method for estimating parameters of the gamma process using noisy data," Reliability Engineering and System Safety, Elsevier, vol. 198(C).
    8. Chapron, Guillaume & Wikenros, Camilla & Liberg, Olof & Wabakken, Petter & Flagstad, Øystein & Milleret, Cyril & Månsson, Johan & Svensson, Linn & Zimmermann, Barbara & Åkesson, Mikael & Sand, Håkan, 2016. "Estimating wolf (Canis lupus) population size from number of packs and an individual based model," Ecological Modelling, Elsevier, vol. 339(C), pages 33-44.
    9. Mathias Silva, 2023. "Parametric estimation of income distributions using grouped data: an Approximate Bayesian Computation approach [Working Papers / Documents de travail]," Working Papers hal-04066544, HAL.
    10. Sifat A Moon & Lee W Cohnstaedt & D Scott McVey & Caterina M Scoglio, 2019. "A spatio-temporal individual-based network framework for West Nile virus in the USA: Spreading pattern of West Nile virus," PLOS Computational Biology, Public Library of Science, vol. 15(3), pages 1-24, March.
    11. Tracy L Stepien & Holley E Lynch & Shirley X Yancey & Laura Dempsey & Lance A Davidson, 2019. "Using a continuum model to decipher the mechanics of embryonic tissue spreading from time-lapse image sequences: An approximate Bayesian computation approach," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-23, June.
    12. Ye Chen & Ilya O. Ryzhov, 2020. "Technical Note—Consistency Analysis of Sequential Learning Under Approximate Bayesian Inference," Operations Research, INFORMS, vol. 68(1), pages 295-307, January.
    13. Warne, David J. & Baker, Ruth E. & Simpson, Matthew J., 2018. "Multilevel rejection sampling for approximate Bayesian computation," Computational Statistics & Data Analysis, Elsevier, vol. 124(C), pages 71-86.
    14. Katharina Nöh & Sebastian Niedenführ & Martin Beyß & Wolfgang Wiechert, 2018. "A Pareto approach to resolve the conflict between information gain and experimental costs: Multiple-criteria design of carbon labeling experiments," PLOS Computational Biology, Public Library of Science, vol. 14(10), pages 1-30, October.
    15. Wilkinson Richard David, 2013. "Approximate Bayesian computation (ABC) gives exact results under the assumption of model error," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(2), pages 129-141, May.
    16. Sebastian Calcetero-Vanegas & Andrei L. Badescu & X. Sheldon Lin, 2022. "Effective a Posteriori Ratemaking with Large Insurance Portfolios via Surrogate Modeling," Papers 2211.06568, arXiv.org, revised May 2023.
    17. Ljubisa Miskovic & Jonas Béal & Michael Moret & Vassily Hatzimanikatis, 2019. "Uncertainty reduction in biochemical kinetic models: Enforcing desired model properties," PLOS Computational Biology, Public Library of Science, vol. 15(8), pages 1-29, August.
    18. Thembi Mdluli & Gregery T Buzzard & Ann E Rundell, 2015. "Efficient Optimization of Stimuli for Model-Based Design of Experiments to Resolve Dynamical Uncertainty," PLOS Computational Biology, Public Library of Science, vol. 11(9), pages 1-23, September.
    19. Johnston Iain G., 2014. "Efficient parametric inference for stochastic biological systems with measured variability," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(3), pages 1-12, June.
    20. Yi Liu & Veronika Ročková & Yuexi Wang, 2021. "Variable selection with ABC Bayesian forests," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(3), pages 453-481, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003650. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.