Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models
By: Jingbo Wang , Sendong Zhao , Jiatong Liu and more
Potential Business Impact:
Smarter AI agents use less computer power.
While multi-agent systems (MAS) have demonstrated superior performance over single-agent approaches in complex reasoning tasks, they often suffer from significant computational inefficiencies. Existing frameworks typically deploy large language models (LLMs) uniformly across all agent roles, failing to account for the varying cognitive demands of different reasoning stages. We address this inefficiency by proposing OI-MAS framework, a novel multi-agent framework that implements an adaptive model-selection policy across a heterogeneous pool of multi-scale LLMs. Specifically, OI-MAS introduces a state-dependent routing mechanism that dynamically selects agent roles and model scales throughout the reasoning process. In addition, we introduce a confidence-aware mechanism that selects appropriate model scales conditioned on task complexity, thus reducing unnecessary reliance on large-scale models. Experimental results show that OI-MAS consistently outperforms baseline multi-agent systems, improving accuracy by up to 12.88\% while reducing cost by up to 79.78\%.
Similar Papers
Towards a Science of Scaling Agent Systems
Artificial Intelligence
Makes AI agents work better together.
Towards Generalized Routing: Model and Agent Orchestration for Adaptive and Efficient Inference
Multiagent Systems
Directs AI questions to the best tool.
Towards Generalized Routing: Model and Agent Orchestration for Adaptive and Efficient Inference
Multiagent Systems
Smartly sends questions to the best AI helper.