IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1010988.html
   My bibliography  Save this article

Bridging the gap between mechanistic biological models and machine learning surrogates

Author

Listed:
  • Ioana M Gherman
  • Zahraa S Abdallah
  • Wei Pang
  • Thomas E Gorochowski
  • Claire S Grierson
  • Lucia Marucci

Abstract

Mechanistic models have been used for centuries to describe complex interconnected processes, including biological ones. As the scope of these models has widened, so have their computational demands. This complexity can limit their suitability when running many simulations or when real-time results are required. Surrogate machine learning (ML) models can be used to approximate the behaviour of complex mechanistic models, and once built, their computational demands are several orders of magnitude lower. This paper provides an overview of the relevant literature, both from an applicability and a theoretical perspective. For the latter, the paper focuses on the design and training of the underlying ML models. Application-wise, we show how ML surrogates have been used to approximate different mechanistic models. We present a perspective on how these approaches can be applied to models representing biological processes with potential industrial applications (e.g., metabolism and whole-cell modelling) and show why surrogate ML models may hold the key to making the simulation of complex biological systems possible using a typical desktop computer.

Suggested Citation

  • Ioana M Gherman & Zahraa S Abdallah & Wei Pang & Thomas E Gorochowski & Claire S Grierson & Lucia Marucci, 2023. "Bridging the gap between mechanistic biological models and machine learning surrogates," PLOS Computational Biology, Public Library of Science, vol. 19(4), pages 1-16, April.
  • Handle: RePEc:plo:pcbi00:1010988
    DOI: 10.1371/journal.pcbi.1010988
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1010988
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010988&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1010988?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Joshua Rees-Garbutt & Oliver Chalkley & Sophie Landon & Oliver Purcell & Lucia Marucci & Claire Grierson, 2020. "Designing minimal genomes using whole-cell models," Nature Communications, Nature, vol. 11(1), pages 1-12, December.
    2. Joshua Rees-Garbutt & Oliver Chalkley & Sophie Landon & Oliver Purcell & Lucia Marucci & Claire Grierson, 2020. "Author Correction: Designing minimal genomes using whole-cell models," Nature Communications, Nature, vol. 11(1), pages 1-1, December.
    3. W Andrew Pruett & Robert L Hester, 2016. "The Creation of Surrogate Models for Fast Estimation of Complex Model Outcomes," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-11, June.
    4. Deb, Chirag & Zhang, Fan & Yang, Junjing & Lee, Siew Eang & Shah, Kwok Wei, 2017. "A review on time series forecasting techniques for building energy consumption," Renewable and Sustainable Energy Reviews, Elsevier, vol. 74(C), pages 902-924.
    5. Marissa Renardy & Tau-Mu Yi & Dongbin Xiu & Ching-Shan Chou, 2018. "Parameter uncertainty quantification using surrogate models applied to a spatial model of yeast mating polarization," PLOS Computational Biology, Public Library of Science, vol. 14(5), pages 1-26, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simeon D. Castle & Michiel Stock & Thomas E. Gorochowski, 2024. "Engineering is evolution: a perspective on design processes to engineer biology," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    2. Alexandru Pîrjan & Simona-Vasilica Oprea & George Căruțașu & Dana-Mihaela Petroșanu & Adela Bâra & Cristina Coculescu, 2017. "Devising Hourly Forecasting Solutions Regarding Electricity Consumption in the Case of Commercial Center Type Consumers," Energies, MDPI, vol. 10(11), pages 1-36, October.
    3. Rao, Congjun & Zhang, Yue & Wen, Jianghui & Xiao, Xinping & Goh, Mark, 2023. "Energy demand forecasting in China: A support vector regression-compositional data second exponential smoothing model," Energy, Elsevier, vol. 263(PC).
    4. Nweye, Kingsley & Nagy, Zoltan, 2022. "MARTINI: Smart meter driven estimation of HVAC schedules and energy savings based on Wi-Fi sensing and clustering," Applied Energy, Elsevier, vol. 316(C).
    5. Deb, C. & Gelder, L.V. & Spiekman, M. & Pandraud, Guillaume & Jack, R. & Fitton, R., 2021. "Measuring the heat transfer coefficient (HTC) in buildings: A stakeholder's survey," Renewable and Sustainable Energy Reviews, Elsevier, vol. 144(C).
    6. Chou, Jui-Sheng & Tran, Duc-Son, 2018. "Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders," Energy, Elsevier, vol. 165(PB), pages 709-726.
    7. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    8. Ewa Chodakowska & Joanicjusz Nazarko & Łukasz Nazarko, 2021. "ARIMA Models in Electrical Load Forecasting and Their Robustness to Noise," Energies, MDPI, vol. 14(23), pages 1-22, November.
    9. Peng, Shiliang & Fan, Lin & Zhang, Li & Su, Huai & He, Yuxuan & He, Qian & Wang, Xiao & Yu, Dejun & Zhang, Jinjun, 2024. "Spatio-temporal prediction of total energy consumption in multiple regions using explainable deep neural network," Energy, Elsevier, vol. 301(C).
    10. Wulfran Fendzi Mbasso & Reagan Jean Jacques Molu & Serge Raoul Dzonde Naoussi & Saatong Kenfack, 2022. "Demand-Supply Forecasting based on Deep Learning for Electricity Balance in Cameroon," International Journal of Energy Economics and Policy, Econjournals, vol. 12(4), pages 99-103, July.
    11. Wang, Qiang & Jiang, Feng, 2019. "Integrating linear and nonlinear forecasting techniques based on grey theory and artificial intelligence to forecast shale gas monthly production in Pennsylvania and Texas of the United States," Energy, Elsevier, vol. 178(C), pages 781-803.
    12. Sarabia Escriva, Emilio José & Hart, Matthew & Acha, Salvador & Soto Francés, Víctor & Shah, Nilay & Markides, Christos N., 2022. "Techno-economic evaluation of integrated energy systems for heat recovery applications in food retail buildings," Applied Energy, Elsevier, vol. 305(C).
    13. Hu, Yuntong & Xiao, Fuyuan, 2022. "A novel method for forecasting time series based on directed visibility graph and improved random walk," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 594(C).
    14. Yoon, Y. & Jung, S. & Im, P. & Salonvaara, M. & Bhandari, M. & Kunwar, N., 2023. "Empirical validation of building energy simulation model input parameter for multizone commercial building during the cooling season," Renewable and Sustainable Energy Reviews, Elsevier, vol. 188(C).
    15. Nazir, Kashif & Memon, Shazim Ali & Saurbayeva, Assemgul, 2024. "A novel framework for developing a machine learning-based forecasting model using multi-stage sensitivity analysis to predict the energy consumption of PCM-integrated building," Applied Energy, Elsevier, vol. 376(PA).
    16. Guillaume Guerard & Hugo Pousseur & Ihab Taleb, 2021. "Isolated Areas Consumption Short-Term Forecasting Method," Energies, MDPI, vol. 14(23), pages 1-23, November.
    17. Di Natale, L. & Svetozarevic, B. & Heer, P. & Jones, C.N., 2023. "Towards scalable physically consistent neural networks: An application to data-driven multi-zone thermal building models," Applied Energy, Elsevier, vol. 340(C).
    18. Aida Boudhaouia & Patrice Wira, 2021. "A Real-Time Data Analysis Platform for Short-Term Water Consumption Forecasting with Machine Learning," Forecasting, MDPI, vol. 3(4), pages 1-13, September.
    19. Afroz, Zakia & Urmee, Tania & Shafiullah, G.M. & Higgins, Gary, 2018. "Real-time prediction model for indoor temperature in a commercial building," Applied Energy, Elsevier, vol. 231(C), pages 29-53.
    20. Mohammad Navid Fekri & Ananda Mohon Ghosh & Katarina Grolinger, 2019. "Generating Energy Data for Machine Learning with Recurrent Generative Adversarial Networks," Energies, MDPI, vol. 13(1), pages 1-23, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1010988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.