Score: 0

Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment

Published: July 11, 2025 | arXiv ID: 2507.08367v1

By: Yuki Yoshihara , Linjing Jiang , Nihan Karatas and more

Potential Business Impact:

Helps cars understand driving situations like people.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This study investigates the potential of a multimodal large language model (LLM), specifically ChatGPT-4o, to perform human-like interpretations of traffic scenes using static dashcam images. Herein, we focus on three judgment tasks relevant to elderly driver assessments: evaluating traffic density, assessing intersection visibility, and recognizing stop signs recognition. These tasks require contextual reasoning rather than simple object detection. Using zero-shot, few-shot, and multi-shot prompting strategies, we evaluated the performance of the model with human annotations serving as the reference standard. Evaluation metrics included precision, recall, and F1-score. Results indicate that prompt design considerably affects performance, with recall for intersection visibility increasing from 21.7% (zero-shot) to 57.0% (multi-shot). For traffic density, agreement increased from 53.5% to 67.6%. In stop-sign detection, the model demonstrated high precision (up to 86.3%) but a lower recall (approximately 76.7%), indicating a conservative response tendency. Output stability analysis revealed that humans and the model faced difficulties interpreting structurally ambiguous scenes. However, the model's explanatory texts corresponded with its predictions, enhancing interpretability. These findings suggest that, with well-designed prompts, LLMs hold promise as supportive tools for scene-level driving risk assessments. Future studies should explore scalability using larger datasets, diverse annotators, and next-generation model architectures for elderly driver assessments.

Large Language Models for Pedestrian Safety: An Application to Predicting Driver Yielding Behavior at Unsignalized Intersections

Computation and Language

Helps cars predict if drivers will stop for people.

24 Sep 2025 1

90%

Investigating Traffic Accident Detection Using Multimodal Large Language Models

CV and Pattern Recognition

Finds car crashes from camera pictures.

23 Sep 2025 0

90%

Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends

CV and Pattern Recognition

Makes cars see and understand everything around them.

21 Apr 2025 0

View PDF Login to Bookmark

Page Count

8 pages

Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment

Helps cars understand driving situations like people.

Technical Abstract

Large Language Models for Pedestrian Safety: An Application to Predicting Driver Yielding Behavior at Unsignalized Intersections

Investigating Traffic Accident Detection Using Multimodal Large Language Models

Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends