Show simple item record

dc.contributor.authorDhingra, Neeraj
dc.contributor.authorBridgelall, Raj
dc.contributor.authorLu, Pan
dc.contributor.authorSzmerekovsky, Joseph
dc.contributor.authorBhardwaj, Bhavana
dc.description.abstractThe reported financial losses from railroad accidents since 2009 have been more than US$4.11 billion dollars. This considerable loss is a major concern for the industry, society, and the government. Therefore, identifying and ranking the factors that contribute to financial losses from railroad accidents would inform strategies to minimize them. To achieve that goal, this paper evaluates and compares the results of applying different non-parametric statistical and regression methods to 15 years of railroad Class I freight train accident data. The models compared are random forest, k-nearest neighbors, support vector machines, stochastic gradient boosting, extreme gradient boosting, and stepwise linear regression. The results indicate that these methods are all suitable for analyzing non-linear and heterogeneous railroad incident data. However, the extreme gradient boosting method provided the best performance. Therefore, the analysis used that model to identify and rank factors that contribute to financial losses, based on the gain percentage of the prediction accuracy. The number of derailed freight cars and the absence of territory signalization dominated as contributing factors in more than 57% and 20% of the accidents, respectively. Partial-dependence plots further explore the complex non-linear dependencies of each factor to better visualize and interpret the results.en_US
dc.rightsIn copyright. Permission to make this version available has been granted by the author and publisher.
dc.titleRanking Risk Factors in Financial Losses From Railroad Incidents: A Machine Learning Approachen_US
dc.typeArticleen_US
dc.typePreprinten_US
dc.descriptionRaj Bridgelall is the program director for the Upper Great Plains Transportation Institute (UGPTI) Center for Surface Mobility Applications & Real-time Simulation environments (SMARTSeSM).en_US
dc.date.accessioned2023-05-23T16:56:56Z
dc.date.available2023-05-23T16:56:56Z
dc.date.issued2023
dc.identifier.urihttps://hdl.handle.net/10365/33156
dc.subjectrailen_US
dc.subjectrail safetyen_US
dc.subjectmachine learningen_US
dc.subjecthuman factors in crashesen_US
dc.subjectsystem safetyen_US
dc.subjectdata scienceen_US
dc.subjectclass I railen_US
dc.subjectfreight trainen_US
dc.identifier.orcid0000-0001-9970-7185
dc.identifier.orcid0000-0003-3743-6652
dc.identifier.orcid0000-0002-1640-3598
dc.identifier.orcid0000-0002-3355-9340
dc.identifier.orcid0000-0002-4379-1565
dc.identifier.citationDhingra, N., Bridgelall, R., Lu, P., Szmerekovsky, J., & Bhardwaj, B. (2023). Ranking Risk Factors in Financial Losses From Railroad Incidents: A Machine Learning Approach. Transportation Research Record, 2677(2), 299–309. https://doi.org/10.1177/03611981221133085en_US
dc.description.sponsorshipThe authors express their deep gratitude to the following funding agencies’ support: North Dakota State University and the Mountain-Plains Consortium (MPC), a University Transportation Center funded by the U.S. Department of Transportation under grant number DTRT13-G-UTC38.en_US
dc.description.urihttps://www.ugpti.org/about/staff/viewbio.php?id=79
dc.language.isoen_USen_US
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.contributor.organizationUpper Great Plains Transportation Institute
ndsu.collegeCollege of Business
ndsu.departmentTransportation and Logistics
dc.identifier.doihttps://doi.org/10.1177/03611981221133085


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record