Score: 0

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Published: January 15, 2026 | arXiv ID: 2601.10679v1

By: Zirui Ren, Ziming Liu

Hierarchical reasoning model (HRM) achieves extraordinary performance on various reasoning tasks, significantly outperforming large language model-based reasoners. To understand the strengths and potential failure modes of HRM, we conduct a mechanistic study on its reasoning patterns and find three surprising facts: (a) Failure of extremely simple puzzles, e.g., HRM can fail on a puzzle with only one unknown cell. We attribute this failure to the violation of the fixed point property, a fundamental assumption of HRM. (b) "Grokking" dynamics in reasoning steps, i.e., the answer is not improved uniformly, but instead there is a critical reasoning step that suddenly makes the answer correct; (c) Existence of multiple fixed points. HRM "guesses" the first fixed point, which could be incorrect, and gets trapped there for a while or forever. All facts imply that HRM appears to be "guessing" instead of "reasoning". Leveraging this "guessing" picture, we propose three strategies to scale HRM's guesses: data augmentation (scaling the quality of guesses), input perturbation (scaling the number of guesses by leveraging inference randomness), and model bootstrapping (scaling the number of guesses by leveraging training randomness). On the practical side, by combining all methods, we develop Augmented HRM, boosting accuracy on Sudoku-Extreme from 54.5% to 96.9%. On the scientific side, our analysis provides new insights into how reasoning models "reason".

HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning

Artificial Intelligence

Helps robots learn to solve problems faster.

26 Oct 2025 0

90%

Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models

Computation and Language

Teaches computers to think better, step-by-step.

16 Mar 2025 0

88%

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Artificial Intelligence

Teaches computers to think smarter, like humans.

3 Sep 2025 0

View PDF Login to Bookmark

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Technical Abstract

HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning

Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning