IDEAS home Printed from https://ideas.repec.org/a/spr/waterr/v39y2025i8d10.1007_s11269-025-04149-y.html
   My bibliography  Save this article

Annual Peak Runoff Forecasting Using Two-Stage Input Variable Selection-Aided k-Nearest-Neighbors Ensemble

Author

Listed:
  • Wei Sun

    (Sun Yat-Sen University
    Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai))

  • Decheng Zeng

    (GDH Feilaixia Hydropower Co Ltd)

  • Shu Chen

    (Sun Yat-Sen University
    Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai))

  • Miaomiao Ren

    (Sun Yat-Sen University
    Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai))

  • Yutong Xie

    (Sun Yat-Sen University
    Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai))

Abstract

Annual river forecasting is a useful tool for practical flood management. However, since it involves a problem with a small dataset, deep learning models with complex network structures are generally not applicable. In this study, two-stage input variable selection (IVS)-aided k-nearest-neighbors (kNN) ensemble models are developed for annual peak flow (APF) forecasting at the Lechangxia Reservoir in China. In the first stage, three correlation-based metrics (Pearson correlation coefficient, Spearman correlation coefficient, and mutual information index) are used to rank tele-connected indicators according to their linear and nonlinear relationships with historical APF. The top indicators identified through this filter method are forwarded to the second stage, where a leave-one-out cross-validation-based exhaustive search systematically evaluates all possible combinations of the retained inputs. The optimal kNN member uses the two-stage IVS method based on the Spearman correlation coefficient. Multiple kNN models are subsequently developed using distinct input subsets, and these models are aggregated through a simple average method to create the final ensemble forecast. The optimal ensemble model improves the validation R and RMSE of the optimal member model, by 17.5% and 16.1%, respectively. This study highlights the effectiveness of improving long-term river forecasting performances through integrating pre-processing (two-stage input variable selection) and post-processing (multi-strategy kNN ensemble) methods.

Suggested Citation

  • Wei Sun & Decheng Zeng & Shu Chen & Miaomiao Ren & Yutong Xie, 2025. "Annual Peak Runoff Forecasting Using Two-Stage Input Variable Selection-Aided k-Nearest-Neighbors Ensemble," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 39(8), pages 4135-4150, June.
  • Handle: RePEc:spr:waterr:v:39:y:2025:i:8:d:10.1007_s11269-025-04149-y
    DOI: 10.1007/s11269-025-04149-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11269-025-04149-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11269-025-04149-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Hadi Sanikhani & Mohammad Reza Nikpour & Fatemeh Jamshidi, 2025. "Advanced Framework for Predicting Rainfall-Runoff: Comparative Evaluation of AI Models for Enhanced Forecasting Accuracy," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 39(7), pages 3273-3294, May.
    2. Maha Shabbir & Sohail Chand & Farhat Iqbal, 2022. "A Novel Hybrid Method for River Discharge Prediction," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(1), pages 253-272, January.
    3. Anas Mahmood Al-Juboori, 2021. "A Hybrid Model to Predict Monthly Streamflow Using Neighboring Rivers Annual Flows," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(2), pages 729-743, January.
    4. Ahmad Khazaee Poul & Mojtaba Shourian & Hadi Ebrahimi, 2019. "A Comparative Study of MLR, KNN, ANN and ANFIS Models with Wavelet Transform in Monthly Stream Flow Prediction," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(8), pages 2907-2923, June.
    5. Salman Sharifazari & Shahab Araghinejad, 2015. "Development of a Nonparametric Model for Multivariate Hydrological Monthly Series Simulation Considering Climate Change Impacts," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(14), pages 5309-5322, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maha Shabbir & Sohail Chand & Farhat Iqbal & Ozgur Kisi, 2024. "Hybrid Approach for Streamflow Prediction: LASSO-Hampel Filter Integration with Support Vector Machines, Artificial Neural Networks, and Autoregressive Distributed Lag Models," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 38(11), pages 4179-4196, September.
    2. Wen-chuan Wang & Yu-jin Du & Kwok-wing Chau & Dong-mei Xu & Chang-jun Liu & Qiang Ma, 2021. "An Ensemble Hybrid Forecasting Model for Annual Runoff Based on Sample Entropy, Secondary Decomposition, and Long Short-Term Memory Neural Network," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(14), pages 4695-4726, November.
    3. Zhiqiang Jiang & Zhengyang Tang & Yi Liu & Yuyun Chen & Zhongkai Feng & Yang Xu & Hairong Zhang, 2019. "Area Moment and Error Based Forecasting Difficulty and its Application in Inflow Forecasting Level Evaluation," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(13), pages 4553-4568, October.
    4. Ali Najah Ahmed & Nguyen Van Thieu & Kai Lun Chong & Yuk Feng Huang & Ahmed El-Shafie, 2025. "A Comparative Analysis of Machine Learning Models for Simulating, Classifying, and Assessment River Inflow," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 39(8), pages 4051-4069, June.
    5. Wenxin Xu & Jie Chen & Xunchang J. Zhang, 2022. "Scale Effects of the Monthly Streamflow Prediction Using a State-of-the-art Deep Learning Model," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(10), pages 3609-3625, August.
    6. Hua Yan & Ming Liu & Bin Yang & Yang Yang & Hu Ni & Haoyu Wang, 2024. "Short-term forecasting approach of single well production based on multi-intelligent agent hybrid model," PLOS ONE, Public Library of Science, vol. 19(4), pages 1-18, April.
    7. Jincheng Zhou & Dan Wang & Shahab S. Band & Changhyun Jun & Sayed M. Bateni & M. Moslehpour & Hao-Ting Pai & Chung-Chian Hsu & Rasoul Ameri, 2023. "Monthly River Discharge Forecasting Using Hybrid Models Based on Extreme Gradient Boosting Coupled with Wavelet Theory and Lévy–Jaya Optimization Algorithm," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(10), pages 3953-3972, August.
    8. Chao Zhang, 2022. "Exploring user cognition difference and pleasure balance guidance method for product perceptible features in vehicle-mounted system," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(3), pages 1019-1030, December.
    9. Ana C. Cebrián & Ricardo Salillas, 2021. "Forecasting High-Frequency River Level Series Using Double Switching Regression with ARMA Errors," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(1), pages 299-313, January.
    10. Yun Bai & Nejc Bezak & Klaudija Sapač & Mateja Klun & Jin Zhang, 2019. "Short-Term Streamflow Forecasting Using the Feature-Enhanced Regression Model," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(14), pages 4783-4797, November.
    11. Hakan Tongal & Martijn J. Booij, 2016. "A Comparison of Nonlinear Stochastic Self-Exciting Threshold Autoregressive and Chaotic k-Nearest Neighbour Models in Daily Streamflow Forecasting," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 30(4), pages 1515-1531, March.
    12. Swades Pal & Pankaj Singha, 2025. "Hydrological alteration and its effect on the eco-hydrological state in Tangon river traversing Bangladesh and India," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 27(5), pages 10013-10039, May.
    13. Manish Kumar & Ahmed Elbeltagi & Chaitanya B. Pande & Ali Najah Ahmed & Ming Fai Chow & Quoc Bao Pham & Anuradha Kumari & Deepak Kumar, 2022. "Applications of Data-driven Models for Daily Discharge Estimation Based on Different Input Combinations," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(7), pages 2201-2221, May.
    14. Wei Li & Jianzhong Zhou & Huaiwei Sun & Kuaile Feng & Hairong Zhang & Muhammad Tayyab, 2017. "Impact of Distribution Type in Bayes Probability Flood Forecasting," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 31(3), pages 961-977, February.
    15. Subbarayan Saravanan & Nagireddy Masthan Reddy & Quoc Bao Pham & Abdullah Alodah & Hazem Ghassan Abdo & Hussein Almohamad & Ahmed Abdullah Al Dughairi, 2023. "Machine Learning Approaches for Streamflow Modeling in the Godavari Basin with CMIP6 Dataset," Sustainability, MDPI, vol. 15(16), pages 1-26, August.
    16. A. B. Dariane & M. I. Borhan, 2024. "Comparison of Classical and Machine Learning Methods in Estimation of Missing Streamflow Data," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 38(4), pages 1453-1478, March.
    17. Fatemeh Bakhshi Ostadkalayeh & Saba Moradi & Ali Asadi & Alireza Moghaddam Nia & Somayeh Taheri, 2023. "Performance Improvement of LSTM-based Deep Learning Model for Streamflow Forecasting Using Kalman Filtering," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(8), pages 3111-3127, June.
    18. Hossien Riahi-Madvar & Majid Dehghani & Rasoul Memarzadeh & Bahram Gharabaghi, 2021. "Short to Long-Term Forecasting of River Flows by Heuristic Optimization Algorithms Hybridized with ANFIS," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(4), pages 1149-1166, March.
    19. M. Rajesh & Sachdeva Anishka & Pansari Satyam Viksit & Srivastav Arohi & S. Rehana, 2023. "Improving Short-range Reservoir Inflow Forecasts with Machine Learning Model Combination," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(1), pages 75-90, January.
    20. Hakan Tongal & Martijn Booij, 2016. "A Comparison of Nonlinear Stochastic Self-Exciting Threshold Autoregressive and Chaotic k-Nearest Neighbour Models in Daily Streamflow Forecasting," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 30(4), pages 1515-1531, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:waterr:v:39:y:2025:i:8:d:10.1007_s11269-025-04149-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.