Policy-based Sentence Simplification: Replacing Parallel Corpora with LLM-as-a-Judge
By: Xuanxin Wu, Yuki Arase, Masaaki Nagata
Potential Business Impact:
Makes hard sentences easy to read automatically.
Sentence simplification aims to modify a sentence to make it easier to read and understand while preserving the meaning. Different applications require distinct simplification policies, such as replacing only complex words at the lexical level or rewriting the entire sentence while trading off details for simplicity. However, achieving such policy-driven control remains an open challenge. In this work, we introduce a simple yet powerful approach that leverages Large Language Model-as-a-Judge (LLM-as-a-Judge) to automatically construct policy-aligned training data, completely removing the need for costly human annotation or parallel corpora. Our method enables building simplification systems that adapt to diverse simplification policies. Remarkably, even small-scale open-source LLMs such as Phi-3-mini-3.8B surpass GPT-4o on lexical-oriented simplification, while achieving comparable performance on overall rewriting, as verified by both automatic metrics and human evaluations. The consistent improvements across model families and sizes demonstrate the robustness of our approach.
Similar Papers
Interpreting LLM-as-a-Judge Policies via Verifiable Global Explanations
Computation and Language
Finds hidden rules in AI's text judgments.
SimplifyMyText: An LLM-Based System for Inclusive Plain Language Text Simplification
Computation and Language
Makes hard words easy for everyone to read.
From Code to Courtroom: LLMs as the New Software Judges
Software Engineering
Lets computers check other computer code quality.