A probabilistic Bayesian inference model to investigate injury severity in automobile crashes
作者:
Highlights:
• Understanding and mitigating factors related to injury severity is challenging problem.
• This study identifies the probabilistic interrelations among the injury related factors.
• The proposed model is based on a Bayesian Belief Network-driven probabilistic model.
• The results are incorporated in a DSS via a web-based probabilistic inference simulator.
摘要
Big data analytics examines millions, if not billions of records, to unmask hidden patterns, provide actionable insights and interpretable results for various domains. One area that has great potential to leverage the value of big data and analytics is the critical analysis of traffic accidents. Investigation results help in providing an in-depth understanding of the risks and provide measures to potentially prevent these risk factors hence enhancing the well-being of individuals who may experience such accidents. This study explains existing models and proposes a data science methodology in a field where probabilistic modeling makes much sense for faster, better decision-making. The main objective of this data analytics study is to identify the high-risk factors with their apparent significance to influence the probability of injury severity on automobile crashes using a geographically representative car crash dataset. To obtain reliable, accurate, and intuitive results, a multi-step probabilistic inference model based on Bayesian Belief Network— highly-acclaimed machine learning methodology—is proposed. The underlying inference model provides researchers with a causally accurate way to explore the domain (with the subject matter expert inputs) while disengaging issues related to statistical correlations and causal effects. In this study, we also used the data to create a web-based probabilistic inference simulator, a Bayesian inference decision support tool, which will be a publicly available/accessible tool, to help decision-makers better understand and to conduct what-if analysis on variable interdependencies.
论文关键词:Data science,Bayesian inference,Injury severity,Decision making,Cross-entropy loss function, Bayesian missing data handling
论文评审过程:Received 15 July 2020, Revised 12 March 2021, Accepted 16 March 2021, Available online 20 March 2021, Version of Record 24 September 2021.
论文官网地址:https://doi.org/10.1016/j.dss.2021.113557