Universal Reasoning Model
By: Zitian Gao , Lynx Chen , Yihao Xiao and more
Universal transformers (UTs) have been widely used for complex reasoning tasks such as ARC-AGI and Sudoku, yet the specific sources of their performance gains remain underexplored. In this work, we systematically analyze UTs variants and show that improvements on ARC-AGI primarily arise from the recurrent inductive bias and strong nonlinear components of Transformer, rather than from elaborate architectural designs. Motivated by this finding, we propose the Universal Reasoning Model (URM), which enhances the UT with short convolution and truncated backpropagation. Our approach substantially improves reasoning performance, achieving state-of-the-art 53.8% pass@1 on ARC-AGI 1 and 16.0% pass@1 on ARC-AGI 2. Our code is avaliable at https://github.com/zitian-gao/URM.
Similar Papers
URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Computation and Language
Helps computers read long papers faster and better.
Think Visually, Reason Textually: Vision-Language Synergy in ARC
CV and Pattern Recognition
Teaches computers to learn like humans do.
Deep Improvement Supervision
Computation and Language
Makes smart computer models learn faster and better.