IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v16y2019i3p334-d200807.html
   My bibliography  Save this article

Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

Author

Listed:
  • Chen Wang

    (Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 210096, China
    Intelligent Transportation Research Center, Southeast University, Nanjing 210096, China)

  • Lin Liu

    (Jiangsu Intelligent Transportation Systems Co., Ltd., Nanjing 210096, China)

  • Chengcheng Xu

    (Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 210096, China)

  • Weitao Lv

    (Jiangsu Intelligent Transportation Systems Co., Ltd., Nanjing 210096, China)

Abstract

The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers’ two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013–2014, 2014–2015, 2015–2016, and 2016–2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016–2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1–2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.

Suggested Citation

  • Chen Wang & Lin Liu & Chengcheng Xu & Weitao Lv, 2019. "Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework," IJERPH, MDPI, vol. 16(3), pages 1-18, January.
  • Handle: RePEc:gam:jijerp:v:16:y:2019:i:3:p:334-:d:200807
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/16/3/334/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/16/3/334/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wang, Shuaian & Chen, Weijie & Zheng, Yuan, 2016. "Willingness to board: A novel concept for modeling queuing up passengersAuthor-Name: Liu, Zhiyuan," Transportation Research Part B: Methodological, Elsevier, vol. 90(C), pages 70-82.
    2. Liu, Zhiyuan & Chen, Xinyuan & Meng, Qiang & Kim, Inhi, 2018. "Remote park-and-ride network equilibrium model and its applications," Transportation Research Part B: Methodological, Elsevier, vol. 117(PA), pages 37-62.
    3. Feng Chen & Suren Chen & Xiaoxiang Ma, 2016. "Crash Frequency Modeling Using Real-Time Environmental and Traffic Data and Unbalanced Panel Data Models," IJERPH, MDPI, vol. 13(6), pages 1-16, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shuaiming Chen & Haipeng Shao & Ximing Ji, 2021. "Insights into Factors Affecting Traffic Accident Severity of Novice and Experienced Drivers: A Machine Learning Approach," IJERPH, MDPI, vol. 18(23), pages 1-20, December.
    2. Muhammad Zahid & Yangzhou Chen & Sikandar Khan & Arshad Jamal & Muhammad Ijaz & Tufail Ahmed, 2020. "Predicting Risky and Aggressive Driving Behavior among Taxi Drivers: Do Spatio-Temporal Attributes Matter?," IJERPH, MDPI, vol. 17(11), pages 1-21, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. An, Qinhe & Fu, Xiao & Huang, Di & Cheng, Qixiu & Liu, Zhiyuan, 2020. "Analysis of adding-runs strategy for peak-hour regular bus services," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 143(C).
    2. Xinyuan Chen & Ruyang Yin & Qinhe An & Yuan Zhang, 2021. "Modeling a Distance-Based Preferential Fare Scheme for Park-and-Ride Services in a Multimodal Transport Network," Sustainability, MDPI, vol. 13(5), pages 1-14, March.
    3. Yulong Bao & Yongle Li & Jiajie Ding, 2016. "A Case Study of Dynamic Response Analysis and Safety Assessment for a Suspended Monorail System," IJERPH, MDPI, vol. 13(11), pages 1-17, November.
    4. Sheng Dong & Afaq Khattak & Irfan Ullah & Jibiao Zhou & Arshad Hussain, 2022. "Predicting and Analyzing Road Traffic Injury Severity Using Boosting-Based Ensemble Learning Models with SHAPley Additive exPlanations," IJERPH, MDPI, vol. 19(5), pages 1-23, March.
    5. Songyot Kitthamkesorn & Anthony Chen & Sathaporn Opasanon & Suwicha Jaita, 2021. "A P-Hub Location Problem for Determining Park-and-Ride Facility Locations with the Weibit-Based Choice Model," Sustainability, MDPI, vol. 13(14), pages 1-16, July.
    6. Chen Xu & Decun Dong & Dongxiu Ou & Changxi Ma, 2019. "Time-of-Day Control Double-Order Optimization of Traffic Safety and Data-Driven Intersections," IJERPH, MDPI, vol. 16(5), pages 1-18, March.
    7. Wang, Yu & Liu, Haoxiang & Fan, Yinchao & Ding, Jianxun & Long, Jiancheng, 2022. "Large-scale multimodal transportation network models and algorithms-Part II: Network capacity and network design problem," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).
    8. Wang, Guangchao & Chen, Anthony & Kitthamkesorn, Songyot & Ryu, Seungkyu & Qi, Hang & Song, Ziqi & Song, Jianguo, 2020. "A multi-modal network equilibrium model with captive mode choice and path size logit route choice," Transportation Research Part A: Policy and Practice, Elsevier, vol. 136(C), pages 293-317.
    9. Wang, Shuaian & Qu, Xiaobo, 2017. "Station choice for Australian commuter rail lines: Equilibrium and optimal fare design," European Journal of Operational Research, Elsevier, vol. 258(1), pages 144-154.
    10. Ming Lv & Xiaojun Shao & Chimou Li & Feng Chen, 2022. "Driving Performance Evaluation of Shuttle Buses: A Case Study of Hong Kong–Zhuhai–Macau Bridge," IJERPH, MDPI, vol. 19(3), pages 1-13, January.
    11. Wang, Shuaian & Wang, Xinchang, 2016. "A polynomial-time algorithm for sailing speed optimization with containership resource sharing," Transportation Research Part B: Methodological, Elsevier, vol. 93(PA), pages 394-405.
    12. Jing Gao & Sen Li, 2023. "Regulating For-Hire Autonomous Vehicles for An Equitable Multimodal Transportation Network," Papers 2301.05798, arXiv.org, revised Oct 2023.
    13. Xiaojun Shao & Xiaoxiang Ma & Feng Chen & Mingtao Song & Xiaodong Pan & Kesi You, 2020. "A Random Parameters Ordered Probit Analysis of Injury Severity in Truck Involved Rear-End Collisions," IJERPH, MDPI, vol. 17(2), pages 1-18, January.
    14. Feng Chen & Xiaoxiang Ma & Suren Chen & Lin Yang, 2016. "Crash Frequency Analysis Using Hurdle Models with Random Effects Considering Short-Term Panel Data," IJERPH, MDPI, vol. 13(11), pages 1-11, October.
    15. Huang, Di & Chen, Xinyuan & Liu, Zhiyuan & Lyu, Cheng & Wang, Shuaian & Chen, Xuewu, 2020. "A static bike repositioning model in a hub-and-spoke network framework," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 141(C).
    16. Huang, Di & Liu, Zhiyuan & Liu, Pan & Chen, Jun, 2016. "Optimal transit fare and service frequency of a nonlinear origin-destination based fare structure," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 96(C), pages 1-19.
    17. Qiang Zeng & Wei Hao & Jaeyoung Lee & Feng Chen, 2020. "Investigating the Impacts of Real-Time Weather Conditions on Freeway Crash Severity: A Bayesian Spatial Analysis," IJERPH, MDPI, vol. 17(8), pages 1-15, April.
    18. Xinyuan Chen & Yiran Wang & Yuan Zhang, 2021. "A Trial-and-Error Toll Design Method for Traffic Congestion Mitigation on Large River-Crossing Channels in a Megacity," Sustainability, MDPI, vol. 13(5), pages 1-13, March.
    19. Shen, Liang & Shao, Hu & Wu, Ting & Fainman, Emily Zhu & Lam, William H.K., 2020. "Finding the reliable shortest path with correlated link travel times in signalized traffic networks under uncertainty," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
    20. Shucheng Yu & Shuaian Wang & Lu Zhen, 2017. "Quay crane scheduling problem with considering tidal impact and fuel consumption," Flexible Services and Manufacturing Journal, Springer, vol. 29(3), pages 345-368, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:16:y:2019:i:3:p:334-:d:200807. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.