Score: 0

Stands to Reason: Investigating the Effect of Reasoning on Idiomaticity Detection

Published: August 18, 2025 | arXiv ID: 2508.13365v1

By: Dylan Phelps , Rodrigo Wilkens , Edward Gow-Smith and more

Potential Business Impact:

Helps computers understand tricky sayings better.

The recent trend towards utilisation of reasoning models has improved the performance of Large Language Models (LLMs) across many tasks which involve logical steps. One linguistic task that could benefit from this framing is idiomaticity detection, as a potentially idiomatic expression must first be understood before it can be disambiguated and serves as a basis for reasoning. In this paper, we explore how reasoning capabilities in LLMs affect idiomaticity detection performance and examine the effect of model size. We evaluate, as open source representative models, the suite of DeepSeek-R1 distillation models ranging from 1.5B to 70B parameters across four idiomaticity detection datasets. We find the effect of reasoning to be smaller and more varied than expected. For smaller models, producing chain-of-thought (CoT) reasoning increases performance from Math-tuned intermediate models, but not to the levels of the base models, whereas larger models (14B, 32B, and 70B) show modest improvements. Our in-depth analyses reveal that larger models demonstrate good understanding of idiomaticity, successfully producing accurate definitions of expressions, while smaller models often fail to output the actual meaning. For this reason, we also experiment with providing definitions in the prompts of smaller models, which we show can improve performance in some cases.

When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

Machine Learning (CS)

Helps computers understand questions better.

30 Apr 2025 1

90%

When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs

Computation and Language

Makes AI follow instructions better by fixing reasoning.

16 May 2025 0

89%

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Artificial Intelligence

Makes AI think faster without losing accuracy.

4 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇬🇧 United Kingdom

Page Count

10 pages

Stands to Reason: Investigating the Effect of Reasoning on Idiomaticity Detection

Helps computers understand tricky sayings better.

Technical Abstract

When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator

When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models