Score: 1

DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models

Published: April 22, 2025 | arXiv ID: 2504.15716v1

By: Jie Zhu , Qian Chen , Huaixia Dou and more

Potential Business Impact:

Helps computers understand money rules better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Effective reasoning remains a core challenge for large language models (LLMs) in the financial domain, where tasks often require domain-specific knowledge, precise numerical calculations, and strict adherence to compliance rules. We propose DianJin-R1, a reasoning-enhanced framework designed to address these challenges through reasoning-augmented supervision and reinforcement learning. Central to our approach is DianJin-R1-Data, a high-quality dataset constructed from CFLUE, FinQA, and a proprietary compliance corpus (Chinese Compliance Check, CCC), combining diverse financial reasoning scenarios with verified annotations. Our models, DianJin-R1-7B and DianJin-R1-32B, are fine-tuned from Qwen2.5-7B-Instruct and Qwen2.5-32B-Instruct using a structured format that generates both reasoning steps and final answers. To further refine reasoning quality, we apply Group Relative Policy Optimization (GRPO), a reinforcement learning method that incorporates dual reward signals: one encouraging structured outputs and another rewarding answer correctness. We evaluate our models on five benchmarks: three financial datasets (CFLUE, FinQA, and CCC) and two general reasoning benchmarks (MATH-500 and GPQA-Diamond). Experimental results show that DianJin-R1 models consistently outperform their non-reasoning counterparts, especially on complex financial tasks. Moreover, on the real-world CCC dataset, our single-call reasoning models match or even surpass the performance of multi-agent systems that require significantly more computational cost. These findings demonstrate the effectiveness of DianJin-R1 in enhancing financial reasoning through structured supervision and reward-aligned learning, offering a scalable and practical solution for real-world applications.

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Computation and Language

Helps computers understand and solve money problems.

20 Mar 2025 2

90%

Fino1: On the Transferability of Reasoning-Enhanced LLMs and Reinforcement Learning to Finance

Computation and Language

Helps computers make smart money decisions.

12 Feb 2025 3

90%

DianJin-OCR-R1: Enhancing OCR Capabilities via a Reasoning-and-Tool Interleaved Vision-Language Model

CV and Pattern Recognition

Makes computers read text better, even if it's messy.

18 Aug 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com github.com

Page Count

15 pages

DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models

Helps computers understand money rules better.

Technical Abstract

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Fino1: On the Transferability of Reasoning-Enhanced LLMs and Reinforcement Learning to Finance

DianJin-OCR-R1: Enhancing OCR Capabilities via a Reasoning-and-Tool Interleaved Vision-Language Model