Score: 2

SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation

Published: March 19, 2025 | arXiv ID: 2503.15358v3

By: Thomas Pickard , Aline Villavicencio , Maggie Mi and more

Potential Business Impact:

Helps computers understand tricky sayings in pictures.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Idiomatic expressions present a unique challenge in NLP, as their meanings are often not directly inferable from their constituent words. Despite recent advancements in Large Language Models (LLMs), idiomaticity remains a significant obstacle to robust semantic representation. We present datasets and tasks for SemEval-2025 Task 1: AdMiRe (Advancing Multimodal Idiomaticity Representation), which challenges the community to assess and improve models' ability to interpret idiomatic expressions in multimodal contexts and in multiple languages. Participants competed in two subtasks: ranking images based on their alignment with idiomatic or literal meanings, and predicting the next image in a sequence. The most effective methods achieved human-level performance by leveraging pretrained LLMs and vision-language models in mixture-of-experts settings, with multiple queries used to smooth over the weaknesses in these models' representations of idiomaticity.

Country of Origin
🇬🇧 United Kingdom

Repos / Data Links

Page Count
13 pages

Category
Computer Science:
Computation and Language