Enhancing Long Document Long Form Summarisation with Self-Planning
By: Xiaotang Du , Rohit Saxena , Laura Perez-Beltrachini and more
Potential Business Impact:
Makes summaries of long texts more accurate.
We introduce a novel approach for long context summarisation, highlight-guided generation, that leverages sentence-level information as a content plan to improve the traceability and faithfulness of generated summaries. Our framework applies self-planning methods to identify important content and then generates a summary conditioned on the plan. We explore both an end-to-end and two-stage variants of the approach, finding that the two-stage pipeline performs better on long and information-dense documents. Experiments on long-form summarisation datasets demonstrate that our method consistently improves factual consistency while preserving relevance and overall quality. On GovReport, our best approach has improved ROUGE-L by 4.1 points and achieves about 35% gains in SummaC scores. Qualitative analysis shows that highlight-guided summarisation helps preserve important details, leading to more accurate and insightful summaries across domains.
Similar Papers
A LongFormer-Based Framework for Accurate and Efficient Medical Text Summarization
Computation and Language
Makes doctor notes shorter and easier to read.
Learning from Self Critique and Refinement for Faithful LLM Summarization
Computation and Language
Teaches AI to write summaries without making things up.
Explanatory Summarization with Discourse-Driven Planning
Computation and Language
Helps computers explain science simply and accurately.