Score: 0

Mapping User Trust in Vision Language Models: Research Landscape, Challenges, and Prospects

Published: May 8, 2025 | arXiv ID: 2505.05318v1

By: Agnese Chiatti , Sara Bernardini , Lara Shibelski Godoy Piccolo and more

Potential Business Impact:

Helps people know when to trust AI that sees and talks.

Business Areas:
Image Recognition Data and Analytics, Software

The rapid adoption of Vision Language Models (VLMs), pre-trained on large image-text and video-text datasets, calls for protecting and informing users about when to trust these systems. This survey reviews studies on trust dynamics in user-VLM interactions, through a multi-disciplinary taxonomy encompassing different cognitive science capabilities, collaboration modes, and agent behaviours. Literature insights and findings from a workshop with prospective VLM users inform preliminary requirements for future VLM trust studies.

Country of Origin
🇮🇹 Italy

Page Count
21 pages

Category
Computer Science:
CV and Pattern Recognition