JOURNAL ARTICLE

Evaluation and interpretation of driving risks: Automobile claim frequency modeling with telematics data

Yaqian GaoYifan HuangShengwang Meng

Year: 2022 Journal:   Statistical Analysis and Data Mining The ASA Data Science Journal Vol: 16 (2)Pages: 97-119   Publisher: Wiley

Abstract

Abstract With the development of vehicle telematics and data mining technology, usage‐based insurance (UBI) has aroused widespread interest from both academia and industry. The extensive driving behavior features make it possible to further understand the risks of insured vehicles, but pose challenges in the identification and interpretation of important ratemaking factors. This study, based on the telematics data of policyholders in China's mainland, analyzes insurance claim frequency of commercial trucks using both Poisson regression and several machine learning models, including regression tree, random forest, gradient boosting tree, XGBoost and neural network. After selecting the best model, we analyze feature importance, feature effects and the contribution of each feature to the prediction from an actuarial perspective. Our empirical study shows that XGBoost greatly outperforms the traditional models and detects some important risk factors, such as the average speed, the average mileage traveled per day, the fraction of night driving, the number of sudden brakes and the fraction of left/right turns at intersections. These features usually have a nonlinear effect on driving risk, and there are complex interactions between features. To further distinguish high−/low‐risk drivers, we run supervised clustering for risk segmentation according to drivers' driving habits. In summary, this study not only provide a more accurate prediction of driving risk, but also greatly satisfy the interpretability requirements of insurance regulators and risk management.

Keywords:
Computer science Telematics Random forest Interpretability Cluster analysis Poisson regression Decision tree Data mining Artificial intelligence Machine learning

Metrics

4
Cited By
0.73
FWCI (Field Weighted Citation Impact)
62
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Traffic and Road Safety
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality
Autonomous Vehicle Technology and Safety
Physical Sciences →  Engineering →  Automotive Engineering
Insurance, Mortality, Demography, Risk Management
Social Sciences →  Social Sciences →  Demography

Related Documents

JOURNAL ARTICLE

Claims frequency modeling using telematics car driving data

Guangyuan GaoShengwang MengMario V. Wüthrich

Journal:   Scandinavian Actuarial Journal Year: 2018 Vol: 2019 (2)Pages: 143-162
JOURNAL ARTICLE

Automobile insurance classification ratemaking based on telematics driving data

Yifan HuangShengwang Meng

Journal:   Decision Support Systems Year: 2019 Vol: 127 Pages: 113156-113156
© 2026 ScienceGate Book Chapters — All rights reserved.