FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
By: Dongwon Jung, Peng Shi, Yi Zhang
Potential Business Impact:
Helps AI teams work together better, faster.
Scaling test-time computation improves large language model performance without additional training. Recent work demonstrates that techniques such as repeated sampling, self-verification, and self-reflection can significantly enhance task success by allocating more inference-time compute. However, applying these techniques across multiple agents in a multi-agent system is difficult: there does not exist principled mechanisms to allocate compute to foster collaboration among agents, to extend test-time scaling to collaborative interactions, or to distribute compute across agents under explicit budget constraints. To address this gap, we propose FutureWeaver, a framework for planning and optimizing test-time compute allocation in multi-agent systems under fixed budgets. FutureWeaver introduces modularized collaboration, formalized as callable functions that encapsulate reusable multi-agent workflows. These modules are automatically derived through self-play reflection by abstracting recurring interaction patterns from past trajectories. Building on these modules, FutureWeaver employs a dual-level planning architecture that optimizes compute allocation by reasoning over the current task state while also speculating on future steps. Experiments on complex agent benchmarks demonstrate that FutureWeaver consistently outperforms baselines across diverse budget settings, validating its effectiveness for multi-agent collaboration in inference-time optimization.
Similar Papers
Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents
Artificial Intelligence
Helps AI decide when to think ahead for tasks.
Scaling Test-time Compute for LLM Agents
Artificial Intelligence
Makes AI agents smarter by letting them think more.
AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks
Artificial Intelligence
Boosts AI for multi-step complex tasks