Synergistic Weak-Strong Collaboration by Aligning Preferences
By: Yizhu Jiao , Xuchao Zhang , Zhaoyang Wang and more
Potential Business Impact:
Helps smart computers learn new, specific jobs.
Current Large Language Models (LLMs) excel in general reasoning yet struggle with specialized tasks requiring proprietary or domain-specific knowledge. Fine-tuning large models for every niche application is often infeasible due to black-box constraints and high computational overhead. To address this, we propose a collaborative framework that pairs a specialized weak model with a general strong model. The weak model, tailored to specific domains, produces initial drafts and background information, while the strong model leverages its advanced reasoning to refine these drafts, extending LLMs' capabilities to critical yet specialized tasks. To optimize this collaboration, we introduce a collaborative feedback to fine-tunes the weak model, which quantifies the influence of the weak model's contributions in the collaboration procedure and establishes preference pairs to guide preference tuning of the weak model. We validate our framework through experiments on three domains. We find that the collaboration significantly outperforms each model alone by leveraging complementary strengths. Moreover, aligning the weak model with the collaborative preference further enhances overall performance.
Similar Papers
Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks
Machine Learning (CS)
Smaller AI helps big AI learn faster.
A Survey on Collaborative Mechanisms Between Large and Small Language Models
Artificial Intelligence
Makes smart AI work on phones and less powerful devices.
Collaboration among Multiple Large Language Models for Medical Question Answering
Computation and Language
Multiple AI doctors solve harder medical questions.