Score: 0

Johnny: Structuring Representation Space to Enhance Machine Abstract Reasoning Ability

Published: May 13, 2025 | arXiv ID: 2506.01970v1

By: Ruizhuo Song, Beiming Yuan

Potential Business Impact:

AI learns to solve tricky picture puzzles better.

Business Areas:
Intelligent Systems Artificial Intelligence, Data and Analytics, Science and Engineering

This paper thoroughly investigates the challenges of enhancing AI's abstract reasoning capabilities, with a particular focus on Raven's Progressive Matrices (RPM) tasks involving complex human-like concepts. Firstly, it dissects the empirical reality that traditional end-to-end RPM-solving models heavily rely on option pool configurations, highlighting that this dependency constrains the model's reasoning capabilities. To address this limitation, the paper proposes the Johnny architecture - a novel representation space-based framework for RPM-solving. Through the synergistic operation of its Representation Extraction Module and Reasoning Module, Johnny significantly enhances reasoning performance by supplementing primitive negative option configurations with a learned representation space. Furthermore, to strengthen the model's capacity for capturing positional relationships among local features, the paper introduces the Spin-Transformer network architecture, accompanied by a lightweight Straw Spin-Transformer variant that reduces computational overhead through parameter sharing and attention mechanism optimization. Experimental evaluations demonstrate that both Johnny and Spin-Transformer achieve superior performance on RPM tasks, offering innovative methodologies for advancing AI's abstract reasoning capabilities.

Country of Origin
🇨🇳 China

Page Count
15 pages

Category
Computer Science:
Machine Learning (CS)