IDEAS home Printed from https://ideas.repec.org/a/eee/ecomod/v430y2020ics0304380020302076.html
   My bibliography  Save this article

Predicting lake surface water phosphorus dynamics using process-guided machine learning

Author

Listed:
  • Hanson, Paul C.
  • Stillman, Aviah B.
  • Jia, Xiaowei
  • Karpatne, Anuj
  • Dugan, Hilary A.
  • Carey, Cayelan C.
  • Stachelek, Joseph
  • Ward, Nicole K.
  • Zhang, Yu
  • Read, Jordan S.
  • Kumar, Vipin

Abstract

Phosphorus (P) loading to lakes is degrading the quality and usability of water globally. Accurate predictions of lake P dynamics are needed to understand whole-ecosystem P budgets, as well as the consequences of changing lake P concentrations for water quality. However, complex biophysical processes within lakes, along with limited observational data, challenge our capacity to reproduce short-term lake dynamics needed for water quality predictions, as well as long-term dynamics needed to understand broad scale controls over lake P. Here we use an emerging paradigm in modeling, process-guided machine learning (PGML), to produce a phosphorus budget for Lake Mendota (Wisconsin, USA) and to accurately predict epilimnetic phosphorus over a time range of days to decades. In our implementation of PGML, which we term a Process-Guided Recurrent Neural Network (PGRNN), we combine a process-based model for lake P with a recurrent neural network, and then constrain the predictions with ecological principles. We test independently the process-based model, the recurrent neural network, and the PGRNN to evaluate the overall approach. The process-based model accounted for most of the observed pattern in lake P; however it missed the long-term trend in lake P and had the worst performance in predicting winter and summer P in surface waters. The root mean square error (RMSE) for the process-based model, the recurrent neural network, and the PGRNN was 33.0 μg P L−1, 22.7 μg P L−1, and 20.7 μg P L−1, respectively. All models performed better during summer, with RMSE values for the three models (same order) equal to 14.3 μg P L−1, 10.9 μg P L−1, and 10.7 μg P L−1. Although the PGRNN had only marginally better RMSE during summer, it had lower bias and reproduced long-term decreases in lake P missed by the other two models. For all seasons and all years, the recurrent neural network had better predictions than process alone, with root mean square error (RMSE) of 23.8 μg P L−1 and 28.0 μg P L−1, respectively. The output of PGRNN indicated that new processes related to water temperature, thermal stratification, and long term changes in external loads are needed to improve the process model. By using ecological knowledge, as well as the information content of complex data, PGML shows promise as a technique for accurate prediction in messy, real-world ecological dynamics, while providing valuable information that can improve our understanding of process.

Suggested Citation

  • Hanson, Paul C. & Stillman, Aviah B. & Jia, Xiaowei & Karpatne, Anuj & Dugan, Hilary A. & Carey, Cayelan C. & Stachelek, Joseph & Ward, Nicole K. & Zhang, Yu & Read, Jordan S. & Kumar, Vipin, 2020. "Predicting lake surface water phosphorus dynamics using process-guided machine learning," Ecological Modelling, Elsevier, vol. 430(C).
  • Handle: RePEc:eee:ecomod:v:430:y:2020:i:c:s0304380020302076
    DOI: 10.1016/j.ecolmodel.2020.109136
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304380020302076
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ecolmodel.2020.109136?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Franz Hamilton & Alun L Lloyd & Kevin B Flores, 2017. "Hybrid modeling and prediction of dynamical systems," PLOS Computational Biology, Public Library of Science, vol. 13(7), pages 1-20, July.
    2. Soetaert, Karline & Petzoldt, Thomas, 2010. "Inverse Modelling, Sensitivity and Monte Carlo Analysis in R Using Package FME," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i03).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Diane A. Isabelle & Mika Westerlund, 2022. "A Review and Categorization of Artificial Intelligence-Based Opportunities in Wildlife, Ocean and Land Conservation," Sustainability, MDPI, vol. 14(4), pages 1-22, February.
    2. Laima Česonienė & Daiva Šileikienė & Vitas Marozas & Laura Čiteikė, 2021. "Influence of Anthropogenic Loads on Surface Water Status: A Case Study in Lithuania," Sustainability, MDPI, vol. 13(8), pages 1-15, April.
    3. Zhang, Xinru & Hou, Lei & Liu, Jiaquan & Yang, Kai & Chai, Chong & Li, Yanhao & He, Sichen, 2022. "Energy consumption prediction for crude oil pipelines based on integrating mechanism analysis and data mining," Energy, Elsevier, vol. 254(PB).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhou, W. & O’Neill, E. & Moncaster, A. & Reiner, D. & Guthrie, P., 2019. "Applying Bayesian Model Averaging to Characterise Urban Residential Stock Turnover Dynamics," Cambridge Working Papers in Economics 1986, Faculty of Economics, University of Cambridge.
    2. Hannah Al Ali & Alireza Daneshkhah & Abdesslam Boutayeb & Zindoga Mukandavire, 2022. "Examining Type 1 Diabetes Mathematical Models Using Experimental Data," IJERPH, MDPI, vol. 19(2), pages 1-20, January.
    3. Taffi, Marianna & Paoletti, Nicola & Liò, Pietro & Pucciarelli, Sandra & Marini, Mauro, 2015. "Bioaccumulation modelling and sensitivity analysis for discovering key players in contaminated food webs: The case study of PCBs in the Adriatic Sea," Ecological Modelling, Elsevier, vol. 306(C), pages 205-215.
    4. Lucash, Melissa S. & Marshall, Adrienne M. & Weiss, Shelby A. & McNabb, John W. & Nicolsky, Dmitry J. & Flerchinger, Gerald N. & Link, Timothy E. & Vogel, Jason G. & Scheller, Robert M. & Abramoff, Ro, 2023. "Burning trees in frozen soil: Simulating fire, vegetation, soil, and hydrology in the boreal forests of Alaska," Ecological Modelling, Elsevier, vol. 481(C).
    5. Meier, Laura & Brauns, Mario & Grimm, Volker & Weitere, Markus & Frank, Karin, 2022. "MASTIFF: A mechanistic model for cross-scale analyses of the functioning of multiple stressed riverine ecosystems," Ecological Modelling, Elsevier, vol. 470(C).
    6. Hussnain Mukhtar & Yu-Pin Lin & Oleg V. Shipin & Joy R. Petway, 2017. "Modeling Nitrogen Dynamics in a Waste Stabilization Pond System Using Flexible Modeling Environment with MCMC," IJERPH, MDPI, vol. 14(7), pages 1-15, July.
    7. Sehjeong Kim & Abdessamad Tridane, 2017. "Thalassemia in the United Arab Emirates: Why it can be prevented but not eradicated," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-13, January.
    8. Lee, Kyoungjae & Lee, Jaeyong & Dass, Sarat C., 2018. "Inference for differential equation models using relaxation via dynamical systems," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 116-134.
    9. Jinyoung Yang & Jeffrey S. Rosenthal, 2017. "Automatically tuned general-purpose MCMC via new adaptive diagnostics," Computational Statistics, Springer, vol. 32(1), pages 315-348, March.
    10. repec:jss:jstsof:33:i09 is not listed on IDEAS
    11. Soetaert, Karline & Petzoldt, Thomas & Setzer, R. Woodrow, 2010. "Solving Differential Equations in R: Package deSolve," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i09).
    12. McCullough, Ian M. & Dugan, Hilary A. & Farrell, Kaitlin J. & Morales-Williams, Ana M. & Ouyang, Zutao & Roberts, Derek & Scordo, Facundo & Bartlett, Sarah L. & Burke, Samantha M. & Doubek, Jonathan P, 2018. "Dynamic modeling of organic carbon fates in lake ecosystems," Ecological Modelling, Elsevier, vol. 386(C), pages 71-82.
    13. Venolia, Celeste T. & Lavaud, Romain & Green-Gavrielidis, Lindsay A. & Thornber, Carol & Humphries, Austin T., 2020. "Modeling the Growth of Sugar Kelp (Saccharina latissima) in Aquaculture Systems using Dynamic Energy Budget Theory," Ecological Modelling, Elsevier, vol. 430(C).
    14. Haas, Marcelo B. & Guse, Björn & Pfannerstill, Matthias & Fohrer, Nicola, 2015. "Detection of dominant nitrate processes in ecohydrological modeling with temporal parameter sensitivity analysis," Ecological Modelling, Elsevier, vol. 314(C), pages 62-72.
    15. Keane, Robert E. & McKenzie, Donald & Falk, Donald A. & Smithwick, Erica A.H. & Miller, Carol & Kellogg, Lara-Karena B., 2015. "Representing climate, disturbance, and vegetation interactions in landscape models," Ecological Modelling, Elsevier, vol. 309, pages 33-47.
    16. Shoya Iwanami & Kosaku Kitagawa & Hirofumi Ohashi & Yusuke Asai & Kaho Shionoya & Wakana Saso & Kazane Nishioka & Hisashi Inaba & Shinji Nakaoka & Takaji Wakita & Odo Diekmann & Shingo Iwami & Koichi , 2020. "Should a viral genome stay in the host cell or leave? A quantitative dynamics study of how hepatitis C virus deals with this dilemma," PLOS Biology, Public Library of Science, vol. 18(7), pages 1-17, July.
    17. Krishna, Shubham & Pahlow, Markus & Schartau, Markus, 2019. "Comparison of two carbon-nitrogen regulatory models calibrated with mesocosm data," Ecological Modelling, Elsevier, vol. 411(C).
    18. Raquel Martins Lana & Maíra Moreira Morais & Tiago França Melo de Lima & Tiago Garcia de Senna Carneiro & Lucas Martins Stolerman & Jefferson Pereira Caldas dos Santos & José Joaquín Carvajal Cortés &, 2018. "Assessment of a trap based Aedes aegypti surveillance program using mathematical modeling," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-16, January.
    19. Littfinski, Tobias & Stricker, Max & Nettmann, Edith & Gehring, Tito & Hiegemann, Heinz & Krimmler, Stefan & Lübken, Manfred & Pant, Deepak & Wichern, Marc, 2022. "A generalized whole-cell model for wastewater-fed microbial fuel cells," Applied Energy, Elsevier, vol. 321(C).
    20. Kankoé Sallah & Roch Giorgi & El-Hadj Ba & Martine Piarroux & Renaud Piarroux & Badara Cisse & Jean Gaudart, 2020. "Targeting Malaria Hotspots to Reduce Transmission Incidence in Senegal," IJERPH, MDPI, vol. 18(1), pages 1-17, December.
    21. Tom Shatwell & Jan Köhler & Andreas Nicklisch, 2014. "Temperature and Photoperiod Interactions with Phosphorus-Limited Growth and Competition of Two Diatoms," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-15, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecomod:v:430:y:2020:i:c:s0304380020302076. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/ecological-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.