Score: 2

Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making

Published: December 9, 2025 | arXiv ID: 2512.08366v1

By: Wentao Zhang , Qunbo Wang , Tao Zhang and more

Potential Business Impact:

Helps AI solve problems by thinking differently.

Business Areas:

Semantic Search Internet Services

Large language model (LLM) agents often rely on external demonstrations or retrieval-augmented planning, leading to brittleness, poor generalization, and high computational overhead. Inspired by human problem-solving, we propose DuSAR (Dual-Strategy Agent with Reflecting) - a demonstration-free framework that enables a single frozen LLM to perform co-adaptive reasoning via two complementary strategies: a high-level holistic plan and a context-grounded local policy. These strategies interact through a lightweight reflection mechanism, where the agent continuously assesses progress via a Strategy Fitness Score and dynamically revises its global plan when stuck or refines it upon meaningful advancement, mimicking human metacognitive behavior. On ALFWorld and Mind2Web, DuSAR achieves state-of-the-art performance with open-source LLMs (7B-70B), reaching 37.1% success on ALFWorld (Llama3.1-70B) - more than doubling the best prior result (13.0%) - and 4.02% on Mind2Web, also more than doubling the strongest baseline. Remarkably, it reduces per-step token consumption by 3-9X while maintaining strong performance. Ablation studies confirm the necessity of dual-strategy coordination. Moreover, optional integration of expert demonstrations further boosts results, highlighting DuSAR's flexibility and compatibility with external knowledge.

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

Artificial Intelligence

Helps computers think faster and learn new things.

6 Oct 2025 1

88%

DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal

Computation and Language

Helps computers write better code, faster.

18 Mar 2025 3

87%

Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making

Artificial Intelligence

Helps computers make smarter money choices with words.

17 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇸🇬 🇨🇳 Singapore, China

Page Count

16 pages

Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making

Helps AI solve problems by thinking differently.

Technical Abstract

MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning

DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal

Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making