Score: 3

Topic-Guided Reinforcement Learning with LLMs for Enhancing Multi-Document Summarization

Published: September 11, 2025 | arXiv ID: 2509.09852v1

By: Chuyuan Li , Austin Xu , Shafiq Joty and more

BigTech Affiliations: Salesforce Research

Potential Business Impact:

Helps computers write better summaries from many stories.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

A key challenge in Multi-Document Summarization (MDS) is effectively integrating information from multiple sources while maintaining coherence and topical relevance. While Large Language Models have shown impressive results in single-document summarization, their performance on MDS still leaves room for improvement. In this paper, we propose a topic-guided reinforcement learning approach to improve content selection in MDS. We first show that explicitly prompting models with topic labels enhances the informativeness of the generated summaries. Building on this insight, we propose a novel topic reward within the Group Relative Policy Optimization (GRPO) framework to measure topic alignment between the generated summary and source documents. Experimental results on the Multi-News and Multi-XScience datasets demonstrate that our method consistently outperforms strong baselines, highlighting the effectiveness of leveraging topical cues in MDS.

Country of Origin
πŸ‡ΊπŸ‡Έ πŸ‡¨πŸ‡¦ Canada, United States

Repos / Data Links

Page Count
18 pages

Category
Computer Science:
Computation and Language