Score: 3

Leveraging Hierarchical Organization for Medical Multi-document Summarization

Published: October 27, 2025 | arXiv ID: 2510.23104v2

By: Yi-Li Hsu, Katelyn X. Mei, Lucy Lu Wang

BigTech Affiliations: University of Washington

Potential Business Impact:

Makes doctor summaries clearer and preferred by people.

Business Areas:
Semantic Search Internet Services

Medical multi-document summarization (MDS) is a complex task that requires effectively managing cross-document relationships. This paper investigates whether incorporating hierarchical structures in the inputs of MDS can improve a model's ability to organize and contextualize information across documents compared to traditional flat summarization methods. We investigate two ways of incorporating hierarchical organization across three large language models (LLMs), and conduct comprehensive evaluations of the resulting summaries using automated metrics, model-based metrics, and domain expert evaluation of preference, understandability, clarity, complexity, relevance, coverage, factuality, and coherence. Our results show that human experts prefer model-generated summaries over human-written summaries. Hierarchical approaches generally preserve factuality, coverage, and coherence of information, while also increasing human preference for summaries. Additionally, we examine whether simulated judgments from GPT-4 align with human judgments, finding higher agreement along more objective evaluation facets. Our findings demonstrate that hierarchical structures can improve the clarity of medical summaries generated by models while maintaining content coverage, providing a practical way to improve human preference for generated summaries.

Country of Origin
🇺🇸 🇹🇼 United States, Taiwan, Province of China

Repos / Data Links

Page Count
19 pages

Category
Computer Science:
Computation and Language