IDEAS home Printed from https://ideas.repec.org/a/plo/pwat00/0000040.html
   My bibliography  Save this article

Modelling point-of-consumption residual chlorine in humanitarian response: Can cost-sensitive learning improve probabilistic forecasts?

Author

Listed:
  • Michael De Santi
  • Syed Imran Ali
  • Matthew Arnold
  • Jean-François Fesselet
  • Anne M J Hyvärinen
  • Dawn Taylor
  • Usman T Khan

Abstract

Ensuring sufficient free residual chlorine (FRC) up to the time and place water is consumed in refugee settlements is essential for preventing the spread of waterborne illnesses. Water system operators need accurate forecasts of FRC during the household storage period. However, factors that drive FRC decay after water leaves the piped distribution system vary substantially, introducing significant uncertainty when modelling point-of-consumption FRC. Artificial neural network (ANN) ensemble forecasting systems (EFS) can account for this uncertainty by generating probabilistic forecasts of point-of-consumption FRC. ANNs are typically trained using symmetrical error metrics like mean squared error (MSE), but this leads to forecast underdispersion forecasts (the spread of the forecast is smaller than the spread of the observations). This study proposes to solve forecast underdispersion by training an ANN-EFS using cost functions that combine alternative metrics (Nash-Sutcliffe efficiency, Kling Gupta Efficiency, Index of Agreement) with cost-sensitive learning (inverse FRC weighting, class-based FRC weighting, inverse frequency weighting). The ANN-EFS trained with each cost function was evaluated using water quality data from refugee settlements in Bangladesh and Tanzania by comparing the percent capture, confidence interval reliability diagrams, rank histograms, and the continuous ranked probability. Training the ANN-EFS using the cost functions developed in this study produced up to a 70% improvement in forecast reliability and dispersion compared to the baseline cost function (MSE), with the best performance typically obtained by training the model using Kling-Gupta Efficiency and inverse frequency weighting. Our findings demonstrate that training the ANN-EFS using alternative metrics and cost-sensitive learning can improve the quality of forecasts of point-of-consumption FRC and better account for uncertainty in post-distribution chlorine decay. These techniques can enable humanitarian responders to ensure sufficient FRC more reliably at the point-of-consumption, thereby preventing the spread of waterborne illnesses.

Suggested Citation

  • Michael De Santi & Syed Imran Ali & Matthew Arnold & Jean-François Fesselet & Anne M J Hyvärinen & Dawn Taylor & Usman T Khan, 2022. "Modelling point-of-consumption residual chlorine in humanitarian response: Can cost-sensitive learning improve probabilistic forecasts?," PLOS Water, Public Library of Science, vol. 1(9), pages 1-30, September.
  • Handle: RePEc:plo:pwat00:0000040
    DOI: 10.1371/journal.pwat.0000040
    as

    Download full text from publisher

    File URL: https://journals.plos.org/water/article?id=10.1371/journal.pwat.0000040
    Download Restriction: no

    File URL: https://journals.plos.org/water/article/file?id=10.1371/journal.pwat.0000040&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pwat.0000040?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Tilmann Gneiting & Fadoua Balabdaoui & Adrian E. Raftery, 2007. "Probabilistic forecasts, calibration and sharpness," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 243-268, April.
    2. Dress, Korbinian & Lessmann, Stefan & von Mettenheim, Hans-Jörg, 2018. "Residual value forecasting using asymmetric cost functions," International Journal of Forecasting, Elsevier, vol. 34(4), pages 551-565.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maneesoonthorn, Worapree & Martin, Gael M. & Forbes, Catherine S. & Grose, Simone D., 2012. "Probabilistic forecasts of volatility and its risk premia," Journal of Econometrics, Elsevier, vol. 171(2), pages 217-236.
    2. Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
    3. Mariangela Guidolin & Stefano Rizzelli, 2025. "Dynamic Forecasting of Gas Consumption in Selected European Countries," Forecasting, MDPI, vol. 7(2), pages 1-29, May.
    4. Nowotarski, Jakub & Weron, Rafał, 2018. "Recent advances in electricity price forecasting: A review of probabilistic forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1548-1568.
    5. Tilmann Gneiting & Larissa Stanberry & Eric Grimit & Leonhard Held & Nicholas Johnson, 2008. "Rejoinder on: Assessing probabilistic forecasts of multivariate quantities, with an application to ensemble predictions of surface winds," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(2), pages 256-264, August.
    6. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    7. Bijak Jakub & Alberts Isabel & Alho Juha & Bryant John & Buettner Thomas & Falkingham Jane & Forster Jonathan J. & Gerland Patrick & King Thomas & Onorante Luca & Keilman Nico & O’Hagan Anthony & Owen, 2015. "Letter to the Editor," Journal of Official Statistics, Sciendo, vol. 31(4), pages 537-544, December.
    8. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    9. Wen, Honglin & Pinson, Pierre & Gu, Jie & Jin, Zhijian, 2024. "Wind energy forecasting with missing values within a fully conditional specification framework," International Journal of Forecasting, Elsevier, vol. 40(1), pages 77-95.
    10. Mike Ludkovski & Glen Swindle & Eric Grannan, 2022. "Large Scale Probabilistic Simulation of Renewables Production," Papers 2205.04736, arXiv.org.
    11. Ganics, Gergely & Odendahl, Florens, 2021. "Bayesian VAR forecasts, survey information, and structural change in the euro area," International Journal of Forecasting, Elsevier, vol. 37(2), pages 971-999.
    12. Geweke, John & Amisano, Gianni, 2011. "Optimal prediction pools," Journal of Econometrics, Elsevier, vol. 164(1), pages 130-141, September.
    13. Daniel Ambach & Robert Garthoff, 2016. "Vorhersagen der Windgeschwindigkeit und Windenergie in Deutschland," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 10(1), pages 15-36, February.
    14. Song, Haiyan & Wen, Long & Liu, Chang, 2019. "Density tourism demand forecasting revisited," Annals of Tourism Research, Elsevier, vol. 75(C), pages 379-392.
    15. Taylor, James W., 2017. "Probabilistic forecasting of wind power ramp events using autoregressive logit models," European Journal of Operational Research, Elsevier, vol. 259(2), pages 703-712.
    16. Ilaria Pia & Elina Numminen & Lari Veneranta & Jarno Vanhatalo, 2025. "Spatially Explicit Model to Disentangle Effects of Environment on Annual Fish Reproduction," Environmetrics, John Wiley & Sons, Ltd., vol. 36(2), March.
    17. Federico Bassetti & Roberto Casarin & Francesco Ravazzolo, 2018. "Bayesian Nonparametric Calibration and Combination of Predictive Distributions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 675-685, April.
    18. Antonio Bracale & Pasquale De Falco, 2015. "An Advanced Bayesian Method for Short-Term Probabilistic Forecasting of the Generation of Wind Power," Energies, MDPI, vol. 8(9), pages 1-22, September.
    19. Tsyplakov, Alexander, 2013. "Evaluation of Probabilistic Forecasts: Proper Scoring Rules and Moments," MPRA Paper 45186, University Library of Munich, Germany.
    20. Satopää, Ville A. & Baron, Jonathan & Foster, Dean P. & Mellers, Barbara A. & Tetlock, Philip E. & Ungar, Lyle H., 2014. "Combining multiple probability predictions using a simple logit model," International Journal of Forecasting, Elsevier, vol. 30(2), pages 344-356.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pwat00:0000040. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: water (email available below). General contact details of provider: https://journals.plos.org/water .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.