XGBoost-SHAP and Unobserved Heterogeneity Modelling of Temporal Multivehicle Truck-Involved Crash Severity Patterns

Truck-Involved Crashes Injury Severities Random Parameters Machine Learning eXtreme Gradient Boosting SHAP.

Authors

  • Wimon Laphrom Institute of Research and Development, Suranaree University of Technology, Nakhon Ratchasima 30000,, Thailand
  • Chamroeun Se Institute of Research and Development, Suranaree University of Technology, Nakhon Ratchasima 30000,, Thailand
  • Thanapong Champahom Department of Management, Faculty of Business Administration, Rajamangala University of Technology Isan, Nakhon Ratchasima 30000,, Thailand
  • Sajjakaj Jomnonkwao School of Transportation Engineering, Institute of Engineering, Suranaree University of Technology, Nakhon Ratchasima 30000,, Thailand
  • Warit Wipulanusatd Department of Civil Engineering, Thammasat School of Engineering, Thammasat University, Pathumthani 12120,, Thailand
  • Thaned Satiennam Department of Civil Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen 40002,, Thailand
  • Vatanavongs Ratanavaraha
    vatanavongs@g.sut.ac.th
    School of Transportation Engineering, Institute of Engineering, Suranaree University of Technology, Nakhon Ratchasima 30000,, Thailand https://orcid.org/0000-0002-4620-5058

Downloads

This paper aims to address the critical issue of multivehicle truck crashes in developing regions, with a focus on Thailand, by analyzing the factors that influence injury severity and comparing the effectiveness of predictive models. Utilizing advanced random parameters and the XGBoost machine learning algorithm, we conducted a comprehensive analysis of injury severity factors in multivehicle truck-involved accidents, contrasting weekdays and weekends. Our findings reveal that the XGBoost model significantly outperforms the heterogeneous logit model in predicting crash severity outcomes, demonstrating superior accuracy, sensitivity, specificity, precision, F1 score, and area under the curve (AUC) in both model training and testing phases. Key risk factors identified include motorcycle involvement, head-on collisions, and crashes occurring during late night/early morning hours, with environmental elements like road lane numbers and weekend hours also playing a significant role. The study introduces XGBoost as a novel and improved method for truck safety analysis, capable of capturing the complex interactions within multivehicle crash data and offering actionable insights for targeted interventions to reduce crash severity. By highlighting specific risk factors and the effectiveness of XGBoost, this research contributes to the development of data-driven strategies for enhancing truck safety in developing countries.

 

Doi: 10.28991/CEJ-2024-010-06-011

Full Text: PDF