Score: 1

Structured Document Translation via Format Reinforcement Learning

Published: December 4, 2025 | arXiv ID: 2512.05100v1

By: Haiyue Song , Johannes Eschbach-Dymanus , Hour Kaing and more

BigTech Affiliations: SAP

Potential Business Impact:

Teaches computers to translate web pages perfectly.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent works on structured text translation remain limited to the sentence level, as they struggle to effectively handle the complex document-level XML or HTML structures. To address this, we propose \textbf{Format Reinforcement Learning (FormatRL)}, which employs Group Relative Policy Optimization on top of a supervised fine-tuning model to directly optimize novel structure-aware rewards: 1) TreeSim, which measures structural similarity between predicted and reference XML trees and 2) Node-chrF, which measures translation quality at the level of XML nodes. Additionally, we apply StrucAUC, a fine-grained metric distinguishing between minor errors and major structural failures. Experiments on the SAP software-documentation benchmark demonstrate improvements across six metrics and an analysis further shows how different reward functions contribute to improvements in both structural and translation quality.

Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning

Computation and Language

Helps computers reason better with organized facts.

16 Oct 2025 1

87%

Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation

Artificial Intelligence

Makes computers turn charts into code better.

19 Aug 2025 3

87%

Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques

Computation and Language

Teaches computers to understand database questions better.

27 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇩🇪 Germany

Page Count

21 pages

Structured Document Translation via Format Reinforcement Learning

Teaches computers to translate web pages perfectly.

Technical Abstract

Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning

Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation

Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques