IDEAS home Printed from https://ideas.repec.org/a/spr/waterr/v39y2025i6d10.1007_s11269-024-04074-6.html
   My bibliography  Save this article

Comparison of Multilayer Perceptron with an Optimal Activation Function and Long Short-Term Memory for Rainfall-Runoff Simulations and Ungauged Catchment Runoff Prediction

Author

Listed:
  • Mun-Ju Shin

    (Jeju Special Self-Governing Province Development Corporation)

  • Yong Jung

    (Wonkwang University)

Abstract

Artificial intelligence (AI) models have recently become increasingly prevalent in hydrological research. Multilayer perceptron (MLP) and long short-term memory (LSTM) are the most popular AI models for rainfall-runoff modeling. LSTM, a recurrent neural network, is more commonly used than MLP, which is a feedforward method, owing to its ability to retain important information from previous data that can support simulations. However, a suitable activation function can overcome this issue when the MLP model is used. In this study, we aimed to identify the best activation function for the MLP model and evaluate its applicability for the prediction in ungauged catchments. We selected seven catchments widely located in Korea, with sizes ranging from 763–6648 km2. To achieve our objectives, we applied and analyzed five activation functions. The performance of the optimal MLP model was compared with that of the LSTM model to assess the ability of MLP to simulate rainfall-runoff and predict runoff in ungauged catchments. Consequently, we used the exponential linear unit (ELU) activation function to develop the MLP-ELU model, which was the most suitable for simulating runoff. The performance of the MLP-ELU model in simulating rainfall-runoff was similar to that of the LSTM model. The prediction performance of runoff of the MLP-ELU model in ungauged catchments was high (Nash–Sutcliffe efficiency ranged from 0.83–0.93 in the seven catchments). Simulated runoff using parameter values estimated for different catchments yielded results that were comparable to those of the calibrated catchment. Therefore, if suitable activation functions are applied, the MLP model can be used for rainfall-runoff studies and water resource design in ungauged catchments. Additionally, the use of MLP models can broaden the range of AI options for hydrology research.

Suggested Citation

  • Mun-Ju Shin & Yong Jung, 2025. "Comparison of Multilayer Perceptron with an Optimal Activation Function and Long Short-Term Memory for Rainfall-Runoff Simulations and Ungauged Catchment Runoff Prediction," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 39(6), pages 2479-2502, April.
  • Handle: RePEc:spr:waterr:v:39:y:2025:i:6:d:10.1007_s11269-024-04074-6
    DOI: 10.1007/s11269-024-04074-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11269-024-04074-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11269-024-04074-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bo Pang & Erik Nijkamp & Ying Nian Wu, 2020. "Deep Learning With TensorFlow: A Review," Journal of Educational and Behavioral Statistics, , vol. 45(2), pages 227-248, April.
    2. Richard H. R. Hahnloser & Rahul Sarpeshkar & Misha A. Mahowald & Rodney J. Douglas & H. Sebastian Seung, 2000. "Correction: Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit," Nature, Nature, vol. 408(6815), pages 1012-1012, December.
    3. Abhinav Kumar Singh & Pankaj Kumar & Rawshan Ali & Nadhir Al-Ansari & Dinesh Kumar Vishwakarma & Kuldeep Singh Kushwaha & Kanhu Charan Panda & Atish Sagar & Ehsan Mirzania & Ahmed Elbeltagi & Alban Ku, 2022. "An Integrated Statistical-Machine Learning Approach for Runoff Prediction," Sustainability, MDPI, vol. 14(13), pages 1-30, July.
    4. Hang Ha & Chinh Luu & Quynh Duy Bui & Duy-Hoa Pham & Tung Hoang & Viet-Phuong Nguyen & Minh Tuan Vu & Binh Thai Pham, 2021. "Flash flood susceptibility prediction mapping for a road network using hybrid machine learning models," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 109(1), pages 1247-1270, October.
    5. Adisa Hammed Akinsoji & Bashir Adelodun & Qudus Adeyi & Rahmon Abiodun Salau & Golden Odey & Kyung Sook Choi, 2024. "Integrating Machine Learning Models with Comprehensive Data Strategies and Optimization Techniques to Enhance Flood Prediction Accuracy: A Review," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 38(12), pages 4735-4761, September.
    6. Xin Yang & Jianzhong Zhou & Qianyi Zhang & Zhanxin Xu & Jianyun Zhang, 2024. "Evaluation and Interpretation of Runoff Forecasting Models Based on Hybrid Deep Neural Networks," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 38(6), pages 1987-2013, April.
    7. Richard H. R. Hahnloser & Rahul Sarpeshkar & Misha A. Mahowald & Rodney J. Douglas & H. Sebastian Seung, 2000. "Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit," Nature, Nature, vol. 405(6789), pages 947-951, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Wen & Yan, Shaoshan & Li, Jian & Tian, Xin & Yoshida, Taketoshi, 2022. "Credit risk prediction of SMEs in supply chain finance by fusing demographic and behavioral data," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 158(C).
    2. Oostwal, Elisa & Straat, Michiel & Biehl, Michael, 2021. "Hidden unit specialization in layered neural networks: ReLU vs. sigmoidal activation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 564(C).
    3. Phattara Khumprom & Nita Yodo, 2019. "A Data-Driven Predictive Prognostic Model for Lithium-ion Batteries based on a Deep Learning Algorithm," Energies, MDPI, vol. 12(4), pages 1-21, February.
    4. Kei Nakagawa & Masaya Abe & Junpei Komiyama, 2019. "A Robust Transferable Deep Learning Framework for Cross-sectional Investment Strategy," Papers 1910.01491, arXiv.org.
    5. Baiti-Ahmad Awaluddin & Chun-Tang Chao & Juing-Shian Chiou, 2023. "Investigating Effective Geometric Transformation for Image Augmentation to Improve Static Hand Gestures with a Pre-Trained Convolutional Neural Network," Mathematics, MDPI, vol. 11(23), pages 1-23, November.
    6. Ke, Shaowei & Zhao, Chen, 2024. "From local utility to neural networks," Journal of Mathematical Economics, Elsevier, vol. 113(C).
    7. Bodendorf, Frank & Xie, Qiao & Merkl, Philipp & Franke, Jörg, 2022. "A multi-perspective approach to support collaborative cost management in supplier-buyer dyads," International Journal of Production Economics, Elsevier, vol. 245(C).
    8. Ahmed A Metwally & Philip S Yu & Derek Reiman & Yang Dai & Patricia W Finn & David L Perkins, 2019. "Utilizing longitudinal microbiome taxonomic profiles to predict food allergy via Long Short-Term Memory networks," PLOS Computational Biology, Public Library of Science, vol. 15(2), pages 1-16, February.
    9. Kim, Seongsu & Kim, Junghwan, 2023. "Assessing fuel economy and NOx emissions of a hydrogen engine bus using neural network algorithms for urban mass transit systems," Energy, Elsevier, vol. 275(C).
    10. Anshun Su & Zehua Du & Bo Yin, 2024. "Non-invasive load monitoring based on deep learning to identify unknown loads," PLOS ONE, Public Library of Science, vol. 19(2), pages 1-20, February.
    11. Maoz Shamir, 2009. "The Temporal Winner-Take-All Readout," PLOS Computational Biology, Public Library of Science, vol. 5(2), pages 1-13, February.
    12. Joanne C Wen & Cecilia S Lee & Pearse A Keane & Sa Xiao & Ariel S Rokem & Philip P Chen & Yue Wu & Aaron Y Lee, 2019. "Forecasting future Humphrey Visual Fields using deep learning," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-14, April.
    13. Jingfen Lan & Ziheng Liao & A. K. Alvi Haque & Qiang Yu & Kun Xie & Yang Guo, 2024. "CNVbd: A Method for Copy Number Variation Detection and Boundary Search," Mathematics, MDPI, vol. 12(3), pages 1-15, January.
    14. Richter, Lucas & Lehna, Malte & Marchand, Sophie & Scholz, Christoph & Dreher, Alexander & Klaiber, Stefan & Lenk, Steve, 2022. "Artificial Intelligence for Electricity Supply Chain automation," Renewable and Sustainable Energy Reviews, Elsevier, vol. 163(C).
    15. González-Muñiz, Ana & Díaz, Ignacio & Cuadrado, Abel A. & García-Pérez, Diego, 2022. "Health indicator for machine condition monitoring built in the latent space of a deep autoencoder," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
    16. Hancock, Thomas O. & Broekaert, Jan & Hess, Stephane & Choudhury, Charisma F., 2020. "Quantum probability: A new method for modelling travel behaviour," Transportation Research Part B: Methodological, Elsevier, vol. 139(C), pages 165-198.
    17. Bernhard Nessler & Michael Pfeiffer & Lars Buesing & Wolfgang Maass, 2013. "Bayesian Computation Emerges in Generic Cortical Microcircuits through Spike-Timing-Dependent Plasticity," PLOS Computational Biology, Public Library of Science, vol. 9(4), pages 1-30, April.
    18. Artem Kuriksha, 2021. "An Economy of Neural Networks: Learning from Heterogeneous Experiences," Papers 2110.11582, arXiv.org.
    19. Li Zhang & Qun Hao & Jie Cao, 2023. "Attention-Based Fine-Grained Lightweight Architecture for Fuji Apple Maturity Classification in an Open-World Orchard Environment," Agriculture, MDPI, vol. 13(2), pages 1-20, January.
    20. Gianluca De Nard & Simon Hediger & Markus Leippold, 2022. "Subsampled factor models for asset pricing: The rise of Vasa," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1217-1247, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:waterr:v:39:y:2025:i:6:d:10.1007_s11269-024-04074-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.