Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems
By: Sanjeda Akter, Ibne Farabi Shihab, Anuj Sharma
Potential Business Impact:
Helps cars understand roads and traffic better.
The integration of Large Language Models (LLMs) with computer vision is profoundly transforming perception tasks like image segmentation. For intelligent transportation systems (ITS), where accurate scene understanding is critical for safety and efficiency, this new paradigm offers unprecedented capabilities. This survey systematically reviews the emerging field of LLM-augmented image segmentation, focusing on its applications, challenges, and future directions within ITS. We provide a taxonomy of current approaches based on their prompting mechanisms and core architectures, and we highlight how these innovations can enhance road scene understanding for autonomous driving, traffic monitoring, and infrastructure maintenance. Finally, we identify key challenges, including real-time performance and safety-critical reliability, and outline a perspective centered on explainable, human-centric AI as a prerequisite for the successful deployment of this technology in next-generation transportation systems.
Similar Papers
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Artificial Intelligence
Helps cars understand traffic better for safer roads.
Semantic Edge-Cloud Communication for Real-Time Urban Traffic Surveillance with ViT and LLMs over Mobile Networks
Networking and Internet Architecture
Makes city traffic cameras send less data.
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
CV and Pattern Recognition
Makes cars see and understand everything around them.