Author
Listed:
- Yang Zhao
(Johns Hopkins University
Johns Hopkins University)
- Pu Wang
(Johns Hopkins University
Johns Hopkins University)
- Yibo Zhao
(Johns Hopkins University)
- Hongru Du
(Johns Hopkins University
Johns Hopkins University
University of Virginia)
- Hao Frank Yang
(Johns Hopkins University
Johns Hopkins University
Johns Hopkins University)
Abstract
Predicting expected traffic crashes and designing targeted interventions are highly challenging due to the inherent complexity of crash data and persistent concerns over the prediction trustworthiness. We introduce SafeTraffic Copilot that adapts Large Language Models (LLMs) to perform expected crash prediction as a text-reasoning task, then attribute critical features for targeted safety interventions. Within the Copilot, SafeTraffic LLM is customized then fine-tuned on the textualized SafeTraffic Event dataset, which consists of 66,205 real-world crash cases with 14.5 million words from five U.S. states. Across multiple prediction tasks including crash type, severity, and number of injuries, SafeTraffic LLM demonstrates a 33.3% to 45.8% improvement in average F1-score over existing works. To interpret these results and inform safety interventions, we introduce SafeTraffic Attribution, a sentence-level feature-attribution framework enabling conditional “what-if" risk analysis. Findings reveal that alcohol-impaired driving is the leading factor for severe crashes, with impairment-related and aggressive behaviors contributing nearly three times more risk than other behaviors. Furthermore, SafeTraffic Attribution identifies critical features during fine-tuning, guiding crash data collection strategies for continual improvement. SafeTraffic Copilot enables prediction and reasoning of conditional crash risks through foundation models, thereby supporting traffic safety improvements and offering clear advantages in generalization, adaptation, and trustworthiness.
Suggested Citation
Yang Zhao & Pu Wang & Yibo Zhao & Hongru Du & Hao Frank Yang, 2025.
"SafeTraffic Copilot: adapting large language models for trustworthy traffic safety assessments and decision interventions,"
Nature Communications, Nature, vol. 16(1), pages 1-17, December.
Handle:
RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64574-w
DOI: 10.1038/s41467-025-64574-w
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64574-w. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.