Score: 0

Collaboration among Multiple Large Language Models for Medical Question Answering

Published: May 22, 2025 | arXiv ID: 2505.16648v1

By: Kexin Shang, Chia-Hsuan Chang, Christopher C. Yang

Potential Business Impact:

Multiple AI doctors solve harder medical questions.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Empowered by vast internal knowledge reservoir, the new generation of large language models (LLMs) demonstrate untapped potential to tackle medical tasks. However, there is insufficient effort made towards summoning up a synergic effect from multiple LLMs' expertise and background. In this study, we propose a multi-LLM collaboration framework tailored on a medical multiple-choice questions dataset. Through post-hoc analysis on 3 pre-trained LLM participants, our framework is proved to boost all LLMs reasoning ability as well as alleviate their divergence among questions. We also measure an LLM's confidence when it confronts with adversary opinions from other LLMs and observe a concurrence between LLM's confidence and prediction accuracy.

Country of Origin
🇺🇸 United States

Page Count
9 pages

Category
Computer Science:
Computation and Language