Score: 0

DrugRAG: Enhancing Pharmacy LLM Performance Through A Novel Retrieval-Augmented Generation Pipeline

Published: December 16, 2025 | arXiv ID: 2512.14896v1

By: Houman Kazemzadeh , Kiarash Mokhtari Dizaji , Seyed Reza Tavakoli and more

Potential Business Impact:

Helps computers answer pharmacy questions better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Objectives: To evaluate large language model (LLM) performance on pharmacy licensure-style question-answering (QA) tasks and develop an external knowledge integration method to improve their accuracy. Methods: We benchmarked eleven existing LLMs with varying parameter sizes (8 billion to 70+ billion) using a 141-question pharmacy dataset. We measured baseline accuracy for each model without modification. We then developed a three-step retrieval-augmented generation (RAG) pipeline, DrugRAG, that retrieves structured drug knowledge from validated sources and augments model prompts with evidence-based context. This pipeline operates externally to the models, requiring no changes to model architecture or parameters. Results: Baseline accuracy ranged from 46% to 92%, with GPT-5 (92%) and o3 (89%) achieving the highest scores. Models with fewer than 8 billion parameters scored below 50%. DrugRAG improved accuracy across all tested models, with gains ranging from 7 to 21 percentage points (e.g., Gemma 3 27B: 61% to 71%, Llama 3.1 8B: 46% to 67%) on the 141-item benchmark. Conclusion: We demonstrate that external structured drug knowledge integration through DrugRAG measurably improves LLM accuracy on pharmacy tasks without modifying the underlying models. This approach provides a practical pipeline for enhancing pharmacy-focused AI applications with evidence-based information.

Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights

Computation and Language

Makes AI doctors more truthful and helpful.

10 Nov 2025 1

92%

Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support

Software Engineering

Makes computer helpers give better, true answers.

22 Jul 2025 0

92%

Aligning LLMs for the Classroom with Knowledge-Based Retrieval -- A Comparative RAG Study

Artificial Intelligence

Makes AI answers for school more truthful.

9 Sep 2025 1

View PDF Login to Bookmark

Page Count

11 pages

DrugRAG: Enhancing Pharmacy LLM Performance Through A Novel Retrieval-Augmented Generation Pipeline

Helps computers answer pharmacy questions better.

Technical Abstract

Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights

Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support

Aligning LLMs for the Classroom with Knowledge-Based Retrieval -- A Comparative RAG Study