Score: 0

Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning

Published: November 26, 2025 | arXiv ID: 2511.21075v1

By: Zhenchao Tang , Fang Wang , Haohuai He and more

Potential Business Impact:

Teaches computers to understand complex science.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Effective post-training is essential to align Large Language Models (LLMs) with specialized biomedical knowledge to accelerate life science research. However, current approaches face significant limitations. First, biomedical reasoning involves intricate mechanisms often represented by sparse textual data. Standard Supervised Fine-Tuning (SFT) tends to overfit to surface-level instruction patterns without effectively internalizing this fragmented scientific knowledge. Second, Reinforcement Learning (RL) is impractical for this domain, as defining meaningful rewards often necessitates prohibitive experimental validation (e.g., wet-lab verification of drug responses), rendering real-time feedback unfeasible. We propose Balanced Fine-Tuning (BFT), an efficient post-training method designed to learn complex reasoning from sparse data without external reward signals. BFT operates through a two-layer weighting mechanism: 1. At the token level, it scales loss via prediction probabilities to stabilize gradients and prevent overfitting; 2. At the sample level, it uses "minimum group confidence" to adaptively enhance the learning of hard samples. Experiments demonstrate that BFT significantly outperforms SFT. In medical tasks, it enables LLMs to acquire knowledge that SFT misses. In biological tasks, BFT-based LLMs surpass GeneAgent (an accurate agent for biology analysis) in biological process reasoning. Moreover, the text embeddings generated by BFT can be directly applied to downstream tasks, such as gene interaction and single-cell perturbation response prediction. These results indicate that BFT facilitates broad applications of LLMs in biomedical research.

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

Computation and Language

Makes AI better at following instructions.

17 Jun 2025 1

91%

AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

Machine Learning (CS)

Teaches computers to learn better and faster.

9 Aug 2025 3

90%

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Machine Learning (CS)

Makes AI better at thinking, even small ones.

14 Dec 2025 0

View PDF Login to Bookmark

Page Count

21 pages

Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning

Teaches computers to understand complex science.

Technical Abstract

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning