Railroad Accident Analysis Using Extreme Gradient Boosting
dc.contributor.author | Bridgelall, Raj | |
dc.contributor.author | Tolliver, Denver D. | |
dc.contributor.organization | Upper Great Plains Transportation Institute | |
dc.date.accessioned | 2021-08-04T21:22:37Z | |
dc.date.available | 2021-08-04T21:22:37Z | |
dc.date.issued | 2021 | |
dc.description | Raj Bridgelall is the program director for the Upper Great Plains Transportation Institute (UGPTI) Center for Surface Mobility Applications & Real-time Simulation environments (SMARTSeSM). | en_US |
dc.description.abstract | Railroads are critical to the economic health of a nation. Unfortunately, railroads lose hundreds of millions of dollars from accidents each year. Trends reveal that derailments consistently account for more than 70% of the U.S. railroad industry’s average annual accident cost. Hence, knowledge of explanatory factors that distinguish derailments from other accident types can inform more cost-effective and impactful railroad risk management strategies. Five feature scoring methods, including ANOVA and Gini, agreed that the top four explanatory factors in accident type prediction were track class, type of movement authority, excess speed, and territory signalization. Among 11 different types of machine learning algorithms, the extreme gradient boosting method was most effective at predicting the accident type with an area under the receiver operating curve (AUC) metric of 89%. Principle component analysis revealed that relative to other accident types, derailments were more strongly associated with lower track classes, non-signalized territories, and movement authorizations within restricted limits. On average, derailments occurred at 16 kph below the speed limit for the track class whereas other accident types occurred at 32 kph below the speed limit. Railroads can use the integrated data preparation, machine learning, and feature ranking framework presented to gain additional insights for managing risk, based on their unique operating environments. | en_US |
dc.description.sponsorship | North Dakota State University | en_US |
dc.description.uri | https://www.ugpti.org/about/staff/viewbio.php?id=79 | |
dc.identifier.citation | Bridgelall, Raj and Denver Tolliver. "Railroad Accident Analysis Using Extreme Gradient Boosting." Accident Analysis and Prevention, DOI:10.1016/j.aap.2021.106126, 156(106126), June 2021. | en_US |
dc.identifier.doi | 10.1016/j.aap.2021.106126 | |
dc.identifier.orcid | 0000-0003-3743-6652 | |
dc.identifier.orcid | 0000-0002-8522-9394 | |
dc.identifier.uri | https://hdl.handle.net/10365/31986 | |
dc.language.iso | en_US | en_US |
dc.rights | In copyright. Permission to make this version available has been granted by the author and publisher. | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | Data cleaning. | en_US |
dc.subject | Feature engineering. | en_US |
dc.subject | Financial loss. | en_US |
dc.subject | Machine learning. | en_US |
dc.subject | Principle component analysis. | en_US |
dc.subject | Risk management. | en_US |
dc.title | Railroad Accident Analysis Using Extreme Gradient Boosting | en_US |
dc.type | Article | en_US |
dc.type | Preprint | en_US |
ndsu.college | College of Business | |
ndsu.department | Transportation and Logistics |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Bridgelall-2021-06-extreme-gradient-boosting.pdf
- Size:
- 1.24 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 2 of 2
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed to upon submission
- Description:
No Thumbnail Available
- Name:
- Permission-Raj Bridgelall via Patrick Nichols-20210419-extreme-gradient-boosting.pdf
- Size:
- 290.03 KB
- Format:
- Adobe Portable Document Format
- Description:
- communication with Patrick Nichols