Kevin Lee’s research while affiliated with Western Michigan University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (2)


Fig. 2. The Study Methodology.
Fig. 3. Word Cloud of Collision and Near-Miss Events.
Fig. 8. Word Clouds of the Discovered Topics from Near-Miss and Event Descriptions.
Fig. 9. It is important not only to identify and interpret topics from near-miss and collision events but also to understand how
Automatic Topics Extraction from Crowdsourced Cyclists Near-Miss and Collision Reports Using Text Mining and Artificial Neural Networks
  • Article
  • Full-text available

October 2021

·

182 Reads

·

9 Citations

International Journal of Transportation Science and Technology

·

·

Kevin Lee

·

[...]

·

Trisalyn Nelson

Cycling is an eco-friendly and sustainable mode of transportation. Despite its benefits, the cyclists’ risk of collision is still high when interacting with other road users. This study analyzed self-reported near-miss and collision event descriptions for the United States provided by the crowdsourcing platform, BikeMaps.org. Innovative and efficient analytic methods are needed to generate useful information from unstructured textual data sources in the transportation domain. In this study, explorative text mining, topic modeling, and machine learning are utilized to gain insights from the unstructured textual descriptions of crowdsourced near-miss and collision events. The approaches are used to unveil prevalent words and word associations for near-miss and collision events. Structural Topic Modeling (STM) is deployed to autogenerate latent themes or topics from the event descriptions. The generated topic proportions are used as input in Artificial Neural Networks (ANN) to estimate the cyclist’s propensity to a collision. It was found that cyclists had a higher propensity to a collision in topics that articulated vehicle encroachment to the bike lane, on-street parking close or into the bike lane resulting in dooring, and drivers’ violations at the crosswalk. The results and methodology used in this study can assist engineers, policymakers, and law enforcement officers to proactively reduce potential cyclist collisions, prioritizing areas where cyclist safety improvements are needed, and ultimately promoting bicycle ridership in our communities.

Download

Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology

December 2020

·

94 Reads

·

52 Citations

Accident Analysis & Prevention

The proliferation of digital textual archives in the transportation safety domain makes it imperative for the inventions of efficient ways of extracting information from the textual data sources. The present study aims at utilizing crash narratives complemented by crash metadata to discern the prevalence and co-occurrence of themes that contribute to crash incidents. Ten years (2009–2018) of Michigan traffic fatal crash narratives were used as a case study. The structural topic modeling (STM) and network topology analysis were used to generate and examine the prevalence and interaction of themes from the crash narratives that were mainly categorized into pre-crash events, crash locations and involved parties in the traffic crashes. The main advantage of the STM over the other topic modeling approaches is that it allows the researchers to discover themes from documents and estimate how the topic relates to the document metadata. Topics with the highest prevalence for the angle, head-on, rear-end, sideswipe and single motor vehicle crashes were crash at stop-sign, crossing the centerline, unable to stop, lane change maneuver and run-off-road crash, respectively. Eigenvector centrality measure in network topology showed that event-related topics were consistently central in articulating the crash occurrence. The centrality and association between topics varied across crash types. The efficacy of generated topics in classifying crashes by type was tested using a machine learning algorithm, Random Forest. The classification accuracy in the held-out sample ranged between 89.3 % for sideswipe crashes to 99.2 % for single motor vehicle crashes. High classification accuracy suggests that automation of crash typing and consistency checks can be accomplished effectively by using extracted latent themes from the crash narratives. Free download link: https://authors.elsevier.com/a/1cB8C_27lVu7d

Citations (2)


... Even if the railroad is not at fault, failures that result in crashes typically receive widespread publicity [14], giving railroads an unfair image of slowness among the ignorant public and frequently fueling demands for urgent reforms [15]. Junctions of transportation modes are unique and possibly dangerous, yet virtually inevitable in all regions of the world [16]. ...

Reference:

Cost Analysis and Prediction of Railroad Level Crossing Accidents for Indian Railways
Automatic Topics Extraction from Crowdsourced Cyclists Near-Miss and Collision Reports Using Text Mining and Artificial Neural Networks

International Journal of Transportation Science and Technology

... Comprehensive pedestrian crash data are vital for safety research, as crash narratives and diagrams provide the most detailed descriptions of crash scenes and pedestrian actions [4,5]. Nonetheless, extracting and effectively utilizing this information from traffic crash reports continues to pose a significant challenge [6,7]. Although numerous studies have utilized different methodologies to examine crash injury severity based on crash narratives [7][8][9], there remains a pressing need for an automated approach to assist researchers in extracting crash patterns from crash diagrams. ...

Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology
  • Citing Article
  • December 2020

Accident Analysis & Prevention