Score: 2

Mixture-of-RAG: Integrating Text and Tables with Large Language Models

Published: April 13, 2025 | arXiv ID: 2504.09554v2

By: Chi Zhang, Qiyang Chen, Mengqi Zhang

Potential Business Impact:

Helps computers understand text and tables together.

Business Areas:

Text Analytics Data and Analytics, Software

Large language models (LLMs) achieve optimal utility when their responses are grounded in external knowledge sources. However, real-world documents, such as annual reports, scientific papers, and clinical guidelines, frequently combine extensive narrative content with complex, hierarchically structured tables. While existing retrieval-augmented generation (RAG) systems effectively integrate LLMs' generative capabilities with external retrieval-based information, their performance significantly deteriorates when processing such heterogeneous text-table hierarchies. To address this limitation, we formalize the task of Heterogeneous Document RAG, which requires joint retrieval and reasoning across textual and hierarchical tabular data. We propose MixRAG, a novel three-stage framework: (i) hierarchy row-and-column-level (H-RCL) representation that preserves hierarchical structure and heterogeneous relationships, (ii) an ensemble retriever with LLM-based reranking for evidence alignment, and (iii) multi-step reasoning decomposition via a RECAP prompt strategy. To bridge the gap in available data for this domain, we release a large-scale dataset, DocRAGLib, a 2k-document corpus paired with automatically aligned text-table summaries and gold document annotations. The comprehensive experimental results demonstrate that MixRAG boosts top-1 retrieval by 46% over strong text-only, table-only, and naive-mixture baselines, establishing new state-of-the-art performance for mixed-modality document grounding.

TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning

Computation and Language

Helps computers answer questions from mixed text and tables.

12 Jun 2025 3

93%

MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

Information Retrieval

Helps AI answer questions better by using smart search.

24 Sep 2025 1

92%

Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding

Computation and Language

Helps computers understand all parts of documents.

17 Oct 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇺🇸 China, United States

Page Count

25 pages

Mixture-of-RAG: Integrating Text and Tables with Large Language Models

Helps computers understand text and tables together.

Technical Abstract

TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning

MIXRAG : Mixture-of-Experts Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding