Score: 0

Enhancing Long Document Long Form Summarisation with Self-Planning

Published: December 19, 2025 | arXiv ID: 2512.17179v1

By: Xiaotang Du , Rohit Saxena , Laura Perez-Beltrachini and more

Potential Business Impact:

Makes summaries of long texts more accurate.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We introduce a novel approach for long context summarisation, highlight-guided generation, that leverages sentence-level information as a content plan to improve the traceability and faithfulness of generated summaries. Our framework applies self-planning methods to identify important content and then generates a summary conditioned on the plan. We explore both an end-to-end and two-stage variants of the approach, finding that the two-stage pipeline performs better on long and information-dense documents. Experiments on long-form summarisation datasets demonstrate that our method consistently improves factual consistency while preserving relevance and overall quality. On GovReport, our best approach has improved ROUGE-L by 4.1 points and achieves about 35% gains in SummaC scores. Qualitative analysis shows that highlight-guided summarisation helps preserve important details, leading to more accurate and insightful summaries across domains.

Country of Origin
🇬🇧 United Kingdom

Page Count
16 pages

Category
Computer Science:
Computation and Language