IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v14y2022i19p12286-d926977.html
   My bibliography  Save this article

Research on Data Cleaning Algorithm Based on Multi Type Construction Waste

Author

Listed:
  • Pengfei Wang

    (School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, No.15 Yongyuan Rd., Daxing District, Beijing 102616, China)

  • Yang Liu

    (School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, No.15 Yongyuan Rd., Daxing District, Beijing 102616, China
    Beijing Advanced Innovation Center for Future Urban Design, Beijing University of Civil Engineering and Architecture, No.1 Zhanlanguan Rd., Xicheng District, Beijing 100044, China
    Beijing Key Laboratory of Urban Spatial Information Engineering, No.15 Yangfangdian Rd., Haidian District, Beijing 100038, China)

  • Qinqin Sun

    (Beijing Key Laboratory of Urban Spatial Information Engineering, No.15 Yangfangdian Rd., Haidian District, Beijing 100038, China)

  • Yingqi Bai

    (Beijing Key Laboratory of Urban Spatial Information Engineering, No.15 Yangfangdian Rd., Haidian District, Beijing 100038, China)

  • Chaopeng Li

    (Beijing Key Laboratory of Urban Spatial Information Engineering, No.15 Yangfangdian Rd., Haidian District, Beijing 100038, China)

Abstract

Owing to urbanization, the output of construction waste is increasing yearly. Garbage treatment plays a vital role in urban development and construction. The accuracy and integrity of data are important for the implementation of construction waste treatment. Abnormal detection and incomplete filling occur when traditional cleaning algorithms are used. To improve the cleaning of construction waste data, a data cleaning algorithm based on multi-type construction waste was presented in this study. First, a multi-algorithm constraint model was designed to achieve accurate matching between the cleaning content and cleaning model. Thereafter, a natural language data cleaning model was proposed, and the spatial location data were separated from the general data through the content separation mechanism to effectively frame the area to be cleaned. Finally, a time series data cleaning model was constructed. By integrating “check” and “fill”, large-span and large-capacity time series data cleaning was realized. This algorithm was applied to the data collected by the pilot cities, which had precision and recall rates of 93.87% and 97.90% respectively, compared with the traditional algorithm, ultimately exhibiting a certain progressiveness. The algorithm proposed herein can be applied to urban environmental governance. Furthermore, this algorithm can markedly improve the control ability and work efficiency of construction waste treatment, and reduce the restriction of construction waste on the sustainable development of urban environments.

Suggested Citation

  • Pengfei Wang & Yang Liu & Qinqin Sun & Yingqi Bai & Chaopeng Li, 2022. "Research on Data Cleaning Algorithm Based on Multi Type Construction Waste," Sustainability, MDPI, vol. 14(19), pages 1-16, September.
  • Handle: RePEc:gam:jsusta:v:14:y:2022:i:19:p:12286-:d:926977
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/14/19/12286/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/14/19/12286/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chung, Won Hee & Gu, Yeong Hyeon & Yoo, Seong Joon, 2022. "District heater load forecasting based on machine learning and parallel CNN-LSTM attention," Energy, Elsevier, vol. 246(C).
    2. Ahmed M. A. Shohda & Mahrous A. M. Ali & Gaofeng Ren & Jong-Gwan Kim & Ahmed M. Abdo & Wael R. Abdellah & Abbas M. Hassan, 2022. "Sustainable Assignment of Egyptian Ornamental Stones for Interior and Exterior Building Finishes Using the AHP-TOPSIS Technique," Sustainability, MDPI, vol. 14(4), pages 1-17, February.
    3. Jan Polcyn, 2022. "Determining Value Added Intellectual Capital (VAIC) Using the TOPSIS-CRITIC Method in Small and Medium-Sized Farms in Selected European Countries," Sustainability, MDPI, vol. 14(6), pages 1-20, March.
    4. Olamide Jogunola & Bamidele Adebisi & Khoa Van Hoang & Yakubu Tsado & Segun I. Popoola & Mohammad Hammoudeh & Raheel Nawaz, 2022. "CBLSTM-AE: A Hybrid Deep Learning Framework for Predicting Energy Consumption," Energies, MDPI, vol. 15(3), pages 1-16, January.
    5. Bo Zeng & Yuxiang Sun & Shaojun Xie, 2022. "Application of LSTM algorithm combined with Kalman filter and SOGI in phase-locked technology of aviation variable frequency power supply," PLOS ONE, Public Library of Science, vol. 17(4), pages 1-16, April.
    6. Jae-Sang Hwang & Sung-Duk Mun & Tae-Joon Kim & Geun-Won Oh & Yeon-Sub Sim & Seung Jin Chang, 2022. "Development of Data Cleaning and Integration Algorithm for Asset Management of Power System," Energies, MDPI, vol. 15(5), pages 1-18, February.
    7. Cui-Bin Ji & Gui-Jiang Duan & Jun-Yan Zhou & Wei-Jie Xuan & Xianming Zhang, 2021. "Equipment Quality Data Integration and Cleaning Based on Multiterminal Collaboration," Complexity, Hindawi, vol. 2021, pages 1-16, October.
    8. Long, Huan & Xu, Shaohui & Gu, Wei, 2022. "An abnormal wind turbine data cleaning algorithm based on color space conversion and image feature detection," Applied Energy, Elsevier, vol. 311(C).
    9. Jin Sol Hwang & Jung-Su Kim & Hwachang Song, 2022. "Handling Load Uncertainty during On-Peak Time via Dual ESS and LSTM with Load Data Augmentation," Energies, MDPI, vol. 15(9), pages 1-20, April.
    10. Vaishnudebi Dutta & Subhomoy Haldar & Prabjot Kaur & Yuvraj Gajpal & Qingyuan Zhu, 2022. "Comparative Analysis of TOPSIS and TODIM for the Performance Evaluation of Foreign Players in Indian Premier League," Complexity, Hindawi, vol. 2022, pages 1-20, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Katerina Zdravkova, 2023. "Personalized Education for Sustainable Development," Sustainability, MDPI, vol. 15(8), pages 1-13, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yasher Ali & Osman Khalid & Imran Ali Khan & Syed Sajid Hussain & Faisal Rehman & Sajid Siraj & Raheel Nawaz, 2022. "A hybrid group-based movie recommendation framework with overlapping memberships," PLOS ONE, Public Library of Science, vol. 17(3), pages 1-28, March.
    2. Ling, Jihong & Zhang, Bingyang & Dai, Na & Xing, Jincheng, 2023. "Coupling input feature construction methods and machine learning algorithms for hourly secondary supply temperature prediction," Energy, Elsevier, vol. 278(C).
    3. Huang, Guizao & Wu, Guangning & Yang, Zefeng & Chen, Xing & Wei, Wenfu, 2023. "Development of surrogate models for evaluating energy transfer quality of high-speed railway pantograph-catenary system using physics-based model and machine learning," Applied Energy, Elsevier, vol. 333(C).
    4. Wang, Chang & Zheng, Jianqin & Liang, Yongtu & Wang, Bohong & Klemeš, Jiří Jaromír & Zhu, Zhu & Liao, Qi, 2022. "Deeppipe: An intelligent monitoring framework for operating condition of multi-product pipelines," Energy, Elsevier, vol. 261(PB).
    5. Runge, Jason & Saloux, Etienne, 2023. "A comparison of prediction and forecasting artificial intelligence models to estimate the future energy demand in a district heating system," Energy, Elsevier, vol. 269(C).
    6. Jun Dong & Xihao Dou & Aruhan Bao & Yaoyu Zhang & Dongran Liu, 2022. "Day-Ahead Spot Market Price Forecast Based on a Hybrid Extreme Learning Machine Technique: A Case Study in China," Sustainability, MDPI, vol. 14(13), pages 1-24, June.
    7. Ummu Salma Al-Azizah & Bagus Pamungkas Wibowo, 2023. "Impact of Intellectual Capital on Financial Performance: Panel Evidence from Banking Industry in Indonesia," Economic Studies journal, Bulgarian Academy of Sciences - Economic Research Institute, issue 5, pages 51-65.
    8. Kostadin Yotov & Emil Hadzhikolev & Stanka Hadzhikoleva & Stoyan Cheresharov, 2022. "Neuro-Cybernetic System for Forecasting Electricity Consumption in the Bulgarian National Power System," Sustainability, MDPI, vol. 14(17), pages 1-18, September.
    9. Shakeel, Asim & Chong, Daotong & Wang, Jinshi, 2023. "Load forecasting of district heating system based on improved FB-Prophet model," Energy, Elsevier, vol. 278(C).
    10. Gonçalves, Rui & Ribeiro, Vitor Miguel & Pereira, Fernando Lobo, 2023. "Variable Split Convolutional Attention: A novel Deep Learning model applied to the household electric power consumption," Energy, Elsevier, vol. 274(C).
    11. Zhang, Dongxue & Wang, Shuai & Liang, Yuqiu & Du, Zhiyuan, 2023. "A novel combined model for probabilistic load forecasting based on deep learning and improved optimizer," Energy, Elsevier, vol. 264(C).
    12. Yao, Qingtao & Zhu, Haowei & Xiang, Ling & Su, Hao & Hu, Aijun, 2023. "A novel composed method of cleaning anomy data for improving state prediction of wind turbine," Renewable Energy, Elsevier, vol. 204(C), pages 131-140.
    13. Chengming Zuo & Juchuan Dai & Guo Li & Mimi Li & Fan Zhang, 2023. "Investigation of Data Pre-Processing Algorithms for Power Curve Modeling of Wind Turbines Based on ECC," Energies, MDPI, vol. 16(6), pages 1-24, March.
    14. Wang, Peng & Li, Yanting & Zhang, Guangyao, 2023. "Probabilistic power curve estimation based on meteorological factors and density LSTM," Energy, Elsevier, vol. 269(C).
    15. Marta Moure-Garrido & Celeste Campo & Carlos Garcia-Rubio, 2022. "Entropy-Based Anomaly Detection in Household Electricity Consumption," Energies, MDPI, vol. 15(5), pages 1-21, March.
    16. Xiangqing Yin & Yi Liu & Li Yang & Wenchao Gao, 2022. "Abnormal Data Cleaning Method for Wind Turbines Based on Constrained Curve Fitting," Energies, MDPI, vol. 15(17), pages 1-22, August.
    17. Jiarong Shi & Zhiteng Wang, 2022. "A Hybrid Forecast Model for Household Electric Power by Fusing Landmark-Based Spectral Clustering and Deep Learning," Sustainability, MDPI, vol. 14(15), pages 1-21, July.
    18. Li, Guannan & Li, Fan & Ahmad, Tanveer & Liu, Jiangyan & Li, Tao & Fang, Xi & Wu, Yubei, 2022. "Performance evaluation of sequence-to-sequence-Attention model for short-term multi-step ahead building energy predictions," Energy, Elsevier, vol. 259(C).
    19. Sekhar, Charan & Dahiya, Ratna, 2023. "Robust framework based on hybrid deep learning approach for short term load forecasting of building electricity demand," Energy, Elsevier, vol. 268(C).
    20. Mario Pérez-Gomariz & Antonio López-Gómez & Fernando Cerdán-Cartagena, 2023. "Artificial Neural Networks as Artificial Intelligence Technique for Energy Saving in Refrigeration Systems—A Review," Clean Technol., MDPI, vol. 5(1), pages 1-21, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:14:y:2022:i:19:p:12286-:d:926977. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.