Score: 0

Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems

Published: June 17, 2025 | arXiv ID: 2506.14096v2

By: Sanjeda Akter, Ibne Farabi Shihab, Anuj Sharma

Potential Business Impact:

Helps cars understand roads and traffic better.

Business Areas:
Image Recognition Data and Analytics, Software

The integration of Large Language Models (LLMs) with computer vision is profoundly transforming perception tasks like image segmentation. For intelligent transportation systems (ITS), where accurate scene understanding is critical for safety and efficiency, this new paradigm offers unprecedented capabilities. This survey systematically reviews the emerging field of LLM-augmented image segmentation, focusing on its applications, challenges, and future directions within ITS. We provide a taxonomy of current approaches based on their prompting mechanisms and core architectures, and we highlight how these innovations can enhance road scene understanding for autonomous driving, traffic monitoring, and infrastructure maintenance. Finally, we identify key challenges, including real-time performance and safety-critical reliability, and outline a perspective centered on explainable, human-centric AI as a prerequisite for the successful deployment of this technology in next-generation transportation systems.

Country of Origin
🇺🇸 United States

Page Count
24 pages

Category
Computer Science:
CV and Pattern Recognition