IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v12y2020i12p4882-d371811.html
   My bibliography  Save this article

Analysis of Run-Off-Road Accidents by Association Rule Mining and Geographic Information System Techniques on Imbalanced Datasets

Author

Listed:
  • Feifeng Jiang

    (Department of Architecture and Civil Engineering, City University of Hong Kong, Hong Kong, China)

  • Kwok Kit Richard Yuen

    (Department of Architecture and Civil Engineering, City University of Hong Kong, Hong Kong, China)

  • Eric Wai Ming Lee

    (Department of Architecture and Civil Engineering, City University of Hong Kong, Hong Kong, China)

  • Jun Ma

    (Department of Research and Development, Big Bay Innovation Research and Development Limited, Hong Kong, China)

Abstract

Run-off-road (ROR) accidents cause a large proportion of fatalities on roads. Exploring key factors is an effective method to reduce fatalities and improve safety sustainability. However, some limitations exist in current studies: (1) Datasets of ROR accidents have imbalance problems, in which the samples of fatal accidents (FA) are much less than non-fatal accidents (NFA). Data mining methods on such imbalanced datasets make the results biased. (2) Few studies conducted spatial analysis of ROR accidents in visualization. Therefore, this study proposes an association rule mining (ARM)-based framework to analyze ROR accidents on imbalanced datasets. A novel method is proposed to address the imbalance problem and ARM is applied to analyze accident severity. Geographic information system (GIS) is adopted for spatial analysis of ROR accidents. The proposed framework is applied to ROR accidents in Victoria, Australia. Six FA factors and seven NFA factors are identified from two-item rules. The results of three-item rules indicate factors acting interactively increase the likelihood of FA or NFA. Hot spots of ROR accidents are presented by GIS maps. Effective measures are accordingly proposed to improve road safety. Compared with traditional data-balancing methods, the proposed framework has been validated to provide more robust and reliable results on imbalanced datasets.

Suggested Citation

  • Feifeng Jiang & Kwok Kit Richard Yuen & Eric Wai Ming Lee & Jun Ma, 2020. "Analysis of Run-Off-Road Accidents by Association Rule Mining and Geographic Information System Techniques on Imbalanced Datasets," Sustainability, MDPI, vol. 12(12), pages 1-32, June.
  • Handle: RePEc:gam:jsusta:v:12:y:2020:i:12:p:4882-:d:371811
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/12/12/4882/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/12/12/4882/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Denis Macharia & Erneus Kaijage & Leif Kindberg & Grace Koech & Lilian Ndungu & Anastasia Wahome & Robinson Mugo, 2020. "Mapping Climate Vulnerability of River Basin Communities in Tanzania to Inform Resilience Interventions," Sustainability, MDPI, vol. 12(10), pages 1-24, May.
    2. Rong-Chang Jou & Tzu-Ying Chen, 2015. "External Costs to Parties Involved in Highway Traffic Accidents: The Perspective of Highway Users," Sustainability, MDPI, vol. 7(6), pages 1-23, June.
    3. Jungeun Park & Yongwoon Cha & Hamad Al Jassmi & Sangwon Han & Chang-taek Hyun, 2020. "Identification of Defect Generation Rules among Defects in Construction Projects Using Association Rule Mining," Sustainability, MDPI, vol. 12(9), pages 1-13, May.
    4. Jianyu Wang & Huapu Lu & Zhiyuan Sun & Tianshi Wang & Katrina Wang, 2020. "Investigating the Impact of Various Risk Factors on Victims of Traffic Accidents," Sustainability, MDPI, vol. 12(9), pages 1-12, May.
    5. Natalia Casado-Sanz & Begoña Guirao & Maria Attard, 2020. "Analysis of the Risk Factors Affecting the Severity of Traffic Accidents on Spanish Crosstown Roads: The Driver’s Perspective," Sustainability, MDPI, vol. 12(6), pages 1-26, March.
    6. Sangdeok Lee & Yongwoon Cha & Sangwon Han & Changtaek Hyun, 2019. "Application of Association Rule Mining and Social Network Analysis for Understanding Causality of Construction Defects," Sustainability, MDPI, vol. 11(3), pages 1-14, January.
    7. Ma, Jun & Cheng, Jack C.P., 2016. "Estimation of the building energy use intensity in the urban scale by integrating GIS and big data technology," Applied Energy, Elsevier, vol. 183(C), pages 182-192.
    8. Igor Dirnbach & Tibor Kubjatko & Eduard Kolla & Ján Ondruš & Željko Šarić, 2020. "Methodology Designed to Evaluate Accidents at Intersection Crossings with Respect to Forensic Purposes and Transport Sustainability," Sustainability, MDPI, vol. 12(5), pages 1-22, March.
    9. Sonam Wangyel Wang & Belay Manjur Gebru & Munkhnasan Lamchin & Rijan Bhakta Kayastha & Woo-Kyun Lee, 2020. "Land Use and Land Cover Change Detection and Prediction in the Kathmandu District of Nepal Using Remote Sensing and GIS," Sustainability, MDPI, vol. 12(9), pages 1-18, May.
    10. Laura Eboli & Carmen Forciniti, 2020. "The Severity of Traffic Crashes in Italy: An Explorative Analysis among Different Driving Circumstances," Sustainability, MDPI, vol. 12(3), pages 1-19, January.
    11. Byeongjoon Noh & Juntae Son & Hansaem Park & Seongju Chang, 2017. "In-Depth Analysis of Energy Efficiency Related Factors in Commercial Buildings Using Data Cube and Association Rule Mining," Sustainability, MDPI, vol. 9(11), pages 1-20, November.
    12. Tosporn Arreeras & Mikiharu Arimura & Takumi Asada & Saharat Arreeras, 2019. "Association Rule Mining Tourist-Attractive Destinations for the Sustainable Development of a Large Tourism Area in Hokkaido Using Wi-Fi Tracking Data," Sustainability, MDPI, vol. 11(14), pages 1-17, July.
    13. Jungyeol Hong & Reuben Tamakloe & Dongjoo Park, 2019. "A Comprehensive Analysis of Multi-Vehicle Crashes on Expressways: A Double Hurdle Approach," Sustainability, MDPI, vol. 11(10), pages 1-22, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jaewoong Yun, 2023. "Strategies for Improving the Sustainability of Fare-Free Policy for the Elderly through Preferences by Travel Modes," Sustainability, MDPI, vol. 15(20), pages 1-14, October.
    2. Vorapot Sapsirisavat & Wiriya Mahikul, 2021. "Drinking and Night-Time Driving May Increase the Risk of Severe Health Outcomes: A 5-Year Retrospective Study of Traffic Injuries among International Travelers at a University Hospital Emergency Cente," IJERPH, MDPI, vol. 18(18), pages 1-9, September.
    3. Jeeyoung Lim & Joseph J. Kim & Sunkuk Kim, 2021. "A Holistic Review of Building Energy Efficiency and Reduction Based on Big Data," Sustainability, MDPI, vol. 13(4), pages 1-18, February.
    4. Vicente Joaquín Delgado-Fernández & María del Carmen Rey-Merchán & Antonio López-Arquillos & Sang D. Choi, 2022. "Occupational Traffic Accidents among Teachers in Spain," IJERPH, MDPI, vol. 19(9), pages 1-9, April.
    5. Xiuguang Song & Rendong Pi & Yu Zhang & Jianqing Wu & Yuhuan Dong & Han Zhang & Xinyuan Zhu, 2021. "Determinants and Prediction of Injury Severities in Multi-Vehicle-Involved Crashes," IJERPH, MDPI, vol. 18(10), pages 1-16, May.
    6. Dongsu Kim & Jongman Lee & Sunglok Do & Pedro J. Mago & Kwang Ho Lee & Heejin Cho, 2022. "Energy Modeling and Model Predictive Control for HVAC in Buildings: A Review of Current Research Trends," Energies, MDPI, vol. 15(19), pages 1-30, October.
    7. Jianyu Wang & Huapu Lu & Zhiyuan Sun & Tianshi Wang, 2020. "Exploring Factors Influencing Injury Severity of Vehicle At-Fault Accidents: A Comparative Analysis of Passenger and Freight Vehicles," IJERPH, MDPI, vol. 17(4), pages 1-12, February.
    8. Ma, Weiwu & Fang, Song & Liu, Gang & Zhou, Ruoyu, 2017. "Modeling of district load forecasting for distributed energy system," Applied Energy, Elsevier, vol. 204(C), pages 181-205.
    9. Zhenzhen Xu & Chunfu Shao & Shengyou Wang & Chunjiao Dong, 2020. "Analysis and Prediction Model of Resident Travel Satisfaction," Sustainability, MDPI, vol. 12(18), pages 1-15, September.
    10. Chunying Ning & Rajan Subedi & Lu Hao, 2023. "Land Use/Cover Change, Fragmentation, and Driving Factors in Nepal in the Last 25 Years," Sustainability, MDPI, vol. 15(8), pages 1-19, April.
    11. Rosenfelder, Markus & Wussow, Moritz & Gust, Gunther & Cremades, Roger & Neumann, Dirk, 2021. "Predicting residential electricity consumption using aerial and street view images," Applied Energy, Elsevier, vol. 301(C).
    12. Tibor Sipos & Anteneh Afework Mekonnen & Zsombor Szabó, 2021. "Spatial Econometric Analysis of Road Traffic Crashes," Sustainability, MDPI, vol. 13(5), pages 1-16, February.
    13. Md Golam Azam & Md Mujibor Rahman, 2022. "Assessing spatial vulnerability of Bangladesh to climate change and extremes: a geographic information system approach," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 27(6), pages 1-35, August.
    14. Gabriel Brătucu & Anca Madar & Dana Boşcor & Codruţa Adina Băltescu & Nicoleta Andreea Neacşu, 2016. "Road Safety Education in the Context of the Sustainable Development of Society: The Romanian Case," Sustainability, MDPI, vol. 8(3), pages 1-13, March.
    15. Jason Jihoon Ree & Cheolhyun Jeong & Hyunseok Park & Kwangsoo Kim, 2019. "Context–Problem Network and Quantitative Method of Patent Analysis: A Case Study of Wireless Energy Transmission Technology," Sustainability, MDPI, vol. 11(5), pages 1-18, March.
    16. Wenliang Li, 2020. "Quantifying the Building Energy Dynamics of Manhattan, New York City, Using an Urban Building Energy Model and Localized Weather Data," Energies, MDPI, vol. 13(12), pages 1-22, June.
    17. Zhang, Yan & Teoh, Bak Koon & Wu, Maozhi & Chen, Jiayu & Zhang, Limao, 2023. "Data-driven estimation of building energy consumption and GHG emissions using explainable artificial intelligence," Energy, Elsevier, vol. 262(PA).
    18. Sujarwo & Aditya Nugraha Putra & Raden Arief Setyawan & Heitor Mancini Teixeira & Uma Khumairoh, 2022. "Forecasting Rice Status for a Food Crisis Early Warning System Based on Satellite Imagery and Cellular Automata in Malang, Indonesia," Sustainability, MDPI, vol. 14(15), pages 1-14, July.
    19. Li, Guannan & Hu, Yunpeng & Chen, Huanxin & Li, Haorong & Hu, Min & Guo, Yabin & Liu, Jiangyan & Sun, Shaobo & Sun, Miao, 2017. "Data partitioning and association mining for identifying VRF energy consumption patterns under various part loads and refrigerant charge conditions," Applied Energy, Elsevier, vol. 185(P1), pages 846-861.
    20. Li, Xinyi & Yao, Runming, 2020. "A machine-learning-based approach to predict residential annual space heating and cooling loads considering occupant behaviour," Energy, Elsevier, vol. 212(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:12:y:2020:i:12:p:4882-:d:371811. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.