Score: 3

Investigating the Development of Task-Oriented Communication in Vision-Language Models

Published: January 28, 2026 | arXiv ID: 2601.20641v1

By: Boaz Carmeli , Orr Paradise , Shafi Goldwasser and more

BigTech Affiliations: University of California, Berkeley

Potential Business Impact:

AI learns secret codes to work together.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We investigate whether \emph{LLM-based agents} can develop task-oriented communication protocols that differ from standard natural language in collaborative reasoning tasks. Our focus is on two core properties such task-oriented protocols may exhibit: Efficiency -- conveying task-relevant information more concisely than natural language, and Covertness -- becoming difficult for external observers to interpret, raising concerns about transparency and control. To investigate these aspects, we use a referential-game framework in which vision-language model (VLM) agents communicate, providing a controlled, measurable setting for evaluating language variants. Experiments show that VLMs can develop effective, task-adapted communication patterns. At the same time, they can develop covert protocols that are difficult for humans and external agents to interpret. We also observe spontaneous coordination between similar models without explicitly shared protocols. These findings highlight both the potential and the risks of task-oriented communication, and position referential games as a valuable testbed for future work in this area.

Country of Origin
🇺🇸 🇮🇱 🇨🇭 Israel, United States, Switzerland

Repos / Data Links

Page Count
42 pages

Category
Computer Science:
Artificial Intelligence