Score: 2

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Published: July 7, 2025 | arXiv ID: 2507.06804v1

By: Zhenwen Liang , Linfeng Song , Yang Li and more

BigTech Affiliations: Tencent

Potential Business Impact:

Helps computers solve hard math problems.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Automated Theorem Proving (ATP) in formal languages is a foundational challenge for AI. While Large Language Models (LLMs) have driven remarkable progress, a significant gap remains between their powerful informal reasoning capabilities and their weak formal proving performance. Recent studies show that the informal accuracy exceeds 80% while formal success remains below 8% on benchmarks like PutnamBench. We argue this gap persists because current state-of-the-art provers, by tightly coupling reasoning and proving, are trained with paradigms that inadvertently punish deep reasoning in favor of shallow, tactic-based strategies. To bridge this fundamental gap, we propose a novel framework that decouples high-level reasoning from low-level proof generation. Our approach utilizes two distinct, specialized models: a powerful, general-purpose Reasoner to generate diverse, strategic subgoal lemmas, and an efficient Prover to rigorously verify them. This modular design liberates the model's full reasoning potential and bypasses the pitfalls of end-to-end training. We evaluate our method on a challenging set of post-2000 IMO problems, a problem set on which no prior open-source prover has reported success. Our decoupled framework successfully solves 5 of these problems, demonstrating a significant step towards automated reasoning on exceptionally difficult mathematical challenges. To foster future research, we release our full dataset of generated and verified lemmas for a wide range of IMO problems, available at https://tencent-imo.github.io/ .

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Artificial Intelligence

Makes computers prove math ideas much faster.

8 Apr 2025 3

89%

From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Artificial Intelligence

Teaches computers to check math and code perfectly.

27 Jan 2025 1

89%

Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs

Logic in Computer Science

Creates math problems for AI to solve.

21 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

37 pages

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Helps computers solve hard math problems.

Technical Abstract

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs