Score: 1

Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement Learning

Published: July 11, 2025 | arXiv ID: 2507.08649v1

By: Xingguang Ji , Yahui Liu , Qi Wang and more

Potential Business Impact:

Helps computers prove math ideas correctly.

Business Areas:

A/B Testing Data and Analytics

We introduce our Leanabell-Prover-V2, a 7B large language models (LLMs) that can produce formal theorem proofs in Lean 4, with verifier-integrated Long Chain-of-Thoughts (CoT). Following our previous work Leanabell-Prover-V1, we continual to choose to posttrain existing strong prover models for further performance improvement. In our V2 version, we mainly upgrade the Reinforcement Learning (RL) with feedback provided by the Lean 4 verifier. Crucially, verifier feedback, such as indicating success or detailing specific errors, allows the LLM to become ``self-aware'' of the correctness of its own reasoning process and learn to reflexively correct errors. Leanabell-Prover-V2 directly optimizes LLM reasoning trajectories with multi-turn verifier interactions, together with feedback token masking for stable RL training and a simple reward strategy. Experiments show that Leanabell-Prover-V2 improves performance by 3.2% (pass@128) with Kimina-Prover-Preview-Distill-7B and 2.0% (pass@128) with DeepSeek-Prover-V2-7B on the MiniF2F test set. The source codes, curated data and models are available at: https://github.com/Leanabell-LM/Leanabell-Prover-V2.

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Artificial Intelligence

Makes computers prove math ideas much faster.

8 Apr 2025 3

89%

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

Computation and Language

Helps computers prove math problems like a genius.

30 Apr 2025 2

89%

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Machine Learning (CS)

Helps computers prove math problems faster.

5 Aug 2025 3

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

23 pages

Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement Learning

Helps computers prove math ideas correctly.

Technical Abstract

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction