IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i6p2918-d515909.html
   My bibliography  Save this article

A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters

Author

Listed:
  • Hai Sun

    (College of Engineering, Ocean University of China, Qingdao 266100, China
    Institute of Marine Development of the Ocean University of China, Ocean University of China, Qingdao 266100, China)

  • Jin Wang

    (College of Engineering, Ocean University of China, Qingdao 266100, China
    Institute of Marine Development of the Ocean University of China, Ocean University of China, Qingdao 266100, China)

  • Wentao Ye

    (School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China)

Abstract

The accurate prediction of storm surge disasters’ direct economic losses plays a positive role in providing critical support for disaster prevention decision-making and management. Previous researches on storm surge disaster loss assessment did not pay much attention to the overfitting phenomenon caused by the data scarcity and the excessive model complexity. To solve these problems, this paper puts forward a new evaluation system for forecasting the regional direct economic loss of storm surge disasters, consisting of three parts. First of all, a comprehensive assessment index system was established by considering the storm surge disasters’ formation mechanism and the corresponding risk management theory. Secondly, a novel data augmentation technique, k-nearest neighbor-Gaussian noise (KNN-GN), was presented to overcome data scarcity. Thirdly, an ensemble learning algorithm XGBoost as a regression model was utilized to optimize the results and produce the final forecasting results. To verify the best-combined model, KNN-GN-based XGBoost, we conducted cross-contrast experiments with several data augmentation techniques and some widely-used ensemble learning models. Meanwhile, the traditional prediction models are used as baselines to the optimized forecasting system. The experimental results show that the KNN-GN-based XGBoost model provides more precise predictions than the traditional models, with a 64.1% average improvement in the mean absolute percentage error (MAPE) measurement. It could be noted that the proposed evaluation system can be extended and applied to the geography-related field as well.

Suggested Citation

  • Hai Sun & Jin Wang & Wentao Ye, 2021. "A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters," IJERPH, MDPI, vol. 18(6), pages 1-23, March.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:6:p:2918-:d:515909
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/6/2918/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/6/2918/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jorge A. Ramirez & Michal Lichter & Tom J. Coulthard & Chris Skinner, 2016. "Hyper-resolution mapping of regional storm surge and tide flooding: comparison of static and dynamic models," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 82(1), pages 571-590, May.
    2. Sebastiaan N. Jonkman & Bob Maaskant & Ezra Boyd & Marc Lloyd Levitan, 2009. "Loss of Life Caused by the Flooding of New Orleans After Hurricane Katrina: Analysis of the Relationship Between Flood Characteristics and Mortality," Risk Analysis, John Wiley & Sons, vol. 29(5), pages 676-698, May.
    3. Tengjiao Guo & Guosheng Li, 2020. "Study on methods to identify the impact factors of economic losses due to typhoon storm surge based on confirmatory factor analysis," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 100(2), pages 515-534, January.
    4. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    5. Shuo Yang & Xin Liu & Qiang Liu, 2016. "A storm surge projection and disaster risk assessment model for China coastal areas," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 84(1), pages 649-667, October.
    6. Eric Dickson & Judy L. Baker & Daniel Hoornweg & Asmita Tiwari, 2012. "Urban Risk Assessments : Understanding Disaster and Climate Risk in Cities," World Bank Publications - Books, The World Bank Group, number 12356, December.
    7. Ji-Myong Kim & Kiyoung Son & Young-Jae Kim, 2019. "Assessing regional typhoon risk of disaster management by clustering typhoon paths," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 21(5), pages 2083-2096, October.
    8. Stefanos Stefanidis & Dimitrios Stathis, 2013. "Assessment of flood hazard based on natural and anthropogenic factors using analytic hierarchy process (AHP)," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 68(2), pages 569-585, September.
    9. Weiping Lou & Haiyan Chen & Xiaoling Shen & Ke Sun & Shengrong Deng, 2012. "Fine assessment of tropical cyclone disasters based on GIS and SVM in Zhejiang Province, China," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 64(1), pages 511-529, October.
    10. repec:wbk:wbpubs:12355 is not listed on IDEAS
    11. Wenjuan Sun & Paolo Bocchini & Brian D. Davison, 2020. "Applications of artificial intelligence for disaster management," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 103(3), pages 2631-2689, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shuxian Liu & Yang Liu & Zhigang Chu & Kun Yang & Guanlan Wang & Lisheng Zhang & Yuanda Zhang, 2023. "Evaluation of Tropical Cyclone Disaster Loss Using Machine Learning Algorithms with an eXplainable Artificial Intelligence Approach," Sustainability, MDPI, vol. 15(16), pages 1-17, August.
    2. Xiaotong Sui & Mingzhao Hu & Haoyun Wang & Lingdi Zhao, 2023. "Improved elasticity estimation model for typhoon storm surge losses in China," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 116(2), pages 2363-2381, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mansoor, Umer & Jamal, Arshad & Su, Junbiao & Sze, N.N. & Chen, Anthony, 2023. "Investigating the risk factors of motorcycle crash injury severity in Pakistan: Insights and policy recommendations," Transport Policy, Elsevier, vol. 139(C), pages 21-38.
    2. Bas Kolen & Matthijs Kok & Ira Helsloot & Bob Maaskant, 2013. "EvacuAid: A Probabilistic Model to Determine the Expected Loss of Life for Different Mass Evacuation Strategies During Flood Threats," Risk Analysis, John Wiley & Sons, vol. 33(7), pages 1312-1333, July.
    3. Ebrahim Ahmadisharaf & Alfred Kalyanapu & Eun-Sung Chung, 2015. "Evaluating the Effects of Inundation Duration and Velocity on Selection of Flood Management Alternatives Using Multi-Criteria Decision Making," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(8), pages 2543-2561, June.
    4. Bissan Ghaddar & Ignacio Gómez-Casares & Julio González-Díaz & Brais González-Rodríguez & Beatriz Pateiro-López & Sofía Rodríguez-Ballesteros, 2023. "Learning for Spatial Branching: An Algorithm Selection Approach," INFORMS Journal on Computing, INFORMS, vol. 35(5), pages 1024-1043, September.
    5. Akash Malhotra, 2018. "A hybrid econometric-machine learning approach for relative importance analysis: Prioritizing food policy," Papers 1806.04517, arXiv.org, revised Aug 2020.
    6. Moumita Palchaudhuri & Sujata Biswas, 2016. "Application of AHP with GIS in drought risk assessment for Puruliya district, India," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 84(3), pages 1905-1920, December.
    7. Nahushananda Chakravarthy H G & Karthik M Seenappa & Sujay Raghavendra Naganna & Dayananda Pruthviraja, 2023. "Machine Learning Models for the Prediction of the Compressive Strength of Self-Compacting Concrete Incorporating Incinerated Bio-Medical Waste Ash," Sustainability, MDPI, vol. 15(18), pages 1-22, September.
    8. Tim Voigt & Martin Kohlhase & Oliver Nelles, 2021. "Incremental DoE and Modeling Methodology with Gaussian Process Regression: An Industrially Applicable Approach to Incorporate Expert Knowledge," Mathematics, MDPI, vol. 9(19), pages 1-26, October.
    9. Wen, Shaoting & Buyukada, Musa & Evrendilek, Fatih & Liu, Jingyong, 2020. "Uncertainty and sensitivity analyses of co-combustion/pyrolysis of textile dyeing sludge and incense sticks: Regression and machine-learning models," Renewable Energy, Elsevier, vol. 151(C), pages 463-474.
    10. Zhu, Haibin & Bai, Lu & He, Lidan & Liu, Zhi, 2023. "Forecasting realized volatility with machine learning: Panel data perspective," Journal of Empirical Finance, Elsevier, vol. 73(C), pages 251-271.
    11. Spiliotis, Evangelos & Makridakis, Spyros & Kaltsounis, Anastasios & Assimakopoulos, Vassilios, 2021. "Product sales probabilistic forecasting: An empirical evaluation using the M5 competition data," International Journal of Production Economics, Elsevier, vol. 240(C).
    12. Zhang, Ning & Li, Zhiying & Zou, Xun & Quiring, Steven M., 2019. "Comparison of three short-term load forecast models in Southern California," Energy, Elsevier, vol. 189(C).
    13. Smyl, Slawek & Hua, N. Grace, 2019. "Machine learning methods for GEFCom2017 probabilistic load forecasting," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1424-1431.
    14. Barzin,Samira & Avner,Paolo & Maruyama Rentschler,Jun Erik & O’Clery,Neave, 2022. "Where Are All the Jobs ? A Machine Learning Approach for High Resolution Urban Employment Prediction inDeveloping Countries," Policy Research Working Paper Series 9979, The World Bank.
    15. You, Heyuan & Zhang, Xiaoling, 2017. "Sustainable livelihoods and rural sustainability in China: Ecologically secure, economically efficient or socially equitable?," Resources, Conservation & Recycling, Elsevier, vol. 120(C), pages 1-13.
    16. Eike Emrich & Christian Pierdzioch, 2016. "Volunteering, Match Quality, and Internet Use," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 136(2), pages 199-226.
    17. Ke Wang & Yongsheng Yang & Genserik Reniers & Quanyi Huang, 2021. "A study into the spatiotemporal distribution of typhoon storm surge disasters in China," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 108(1), pages 1237-1256, August.
    18. Kusiak, Andrew & Zheng, Haiyang & Song, Zhe, 2009. "On-line monitoring of power curves," Renewable Energy, Elsevier, vol. 34(6), pages 1487-1493.
    19. Zhu, Siying & Zhu, Feng, 2019. "Cycling comfort evaluation with instrumented probe bicycle," Transportation Research Part A: Policy and Practice, Elsevier, vol. 129(C), pages 217-231.
    20. Catherine Ikae & Jacques Savoy, 2022. "Gender identification on Twitter," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(1), pages 58-69, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:6:p:2918-:d:515909. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.