IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v15y2023i7p6083-d1113146.html
   My bibliography  Save this article

Time Series Data Preparation for Failure Prediction in Smart Water Taps (SWT)

Author

Listed:
  • Nsikak Mitchel Offiong

    (Centre for Water Systems, University of Exeter, Exeter EX4 4QF, UK)

  • Fayyaz Ali Memon

    (Centre for Water Systems, University of Exeter, Exeter EX4 4QF, UK)

  • Yulei Wu

    (Department of Computer Science, EMPS, University of Exeter, Exeter EX4 4QF, UK)

Abstract

Smart water tap (SWT) time series model development for failure prediction requires acquiring data on the variables of interest to researchers, planners, engineers and decision makers. Thus, the data are expected to be ‘noiseless’ (i.e., without discrepancies such as missing data, data redundancy and data duplication) raw inputs for modelling and forecasting tasks. However, historical datasets acquired from the SWTs contain data discrepancies that require preparation before applying the dataset to develop a failure prediction model. This paper presents a combination of the generative adversarial network (GAN) and the bidirectional gated recurrent unit (BiGRU) techniques for missing data imputation. The GAN aids in training the SWT data trend and distribution, enabling the imputed data to be closely similar to the historical dataset. On the other hand, the BiGRU was adopted to save computational time by combining the model’s cell state and hidden state during data imputation. After data imputation there were outliers, and the exponential smoothing method was used to balance the data. The result shows that this method can be applied in time series systems to correct missing values in a dataset, thereby mitigating data noise that can lead to a biased failure prediction model. Furthermore, when evaluated using different sets of historical SWT data, the method proved reliable for missing data imputation and achieved better training time than the traditional data imputation method.

Suggested Citation

  • Nsikak Mitchel Offiong & Fayyaz Ali Memon & Yulei Wu, 2023. "Time Series Data Preparation for Failure Prediction in Smart Water Taps (SWT)," Sustainability, MDPI, vol. 15(7), pages 1-15, March.
  • Handle: RePEc:gam:jsusta:v:15:y:2023:i:7:p:6083-:d:1113146
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/15/7/6083/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/15/7/6083/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hamori, Shigeyuki & Motegi, Kaiji & Zhang, Zheng, 2020. "Copula-based regression models with data missing at random," Journal of Multivariate Analysis, Elsevier, vol. 180(C).
    2. Sharon A. Jones & Kristen L. Sanford Bernhardt & Mark Kennedy & Kelsey Lantz & Trent Holden, 2013. "Collecting Critical Data to Assess the Sustainability of Rural Infrastructure in Low-Income Countries," Sustainability, MDPI, vol. 5(11), pages 1-19, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fekadu Megersa Senbeta & Yang Shu, 2019. "Project Implementation Management Modalities and Their Implications on Sustainability of Water Services in Rural Areas in Ethiopia: Are Community-Managed Projects More Effective?," Sustainability, MDPI, vol. 11(6), pages 1-19, March.
    2. Isabel Domínguez & Edgar Ricardo Oviedo-Ocaña & Karen Hurtado & Andrés Barón & Ralph P. Hall, 2019. "Assessing Sustainability in Rural Water Supply Systems in Developing Countries Using a Novel Tool Based on Multi-Criteria Analysis," Sustainability, MDPI, vol. 11(19), pages 1-22, September.
    3. Haijing Zhang & Qingyun Du & Min Yao & Fu Ren, 2016. "Evaluation and Clustering Maps of Groundwater Wells in the Red Beds of Chengdu, Sichuan, China," Sustainability, MDPI, vol. 8(1), pages 1-21, January.
    4. Boulin, Alexis & Di Bernardino, Elena & Laloë, Thomas & Toulemonde, Gwladys, 2022. "Non-parametric estimator of a multivariate madogram for missing-data and extreme value framework," Journal of Multivariate Analysis, Elsevier, vol. 192(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:15:y:2023:i:7:p:6083-:d:1113146. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.