Score: 0

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Published: November 26, 2025 | arXiv ID: 2511.20993v1

By: Shanwei Fan

Potential Business Impact:

Helps robots follow plans by checking steps.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models (LLMs) offer strong high-level planning capabilities for reinforcement learning (RL) by decomposing tasks into subgoals. However, their practical utility is limited by poor planning-execution alignment, which reflects a critical gap between abstract plans and actionable, environment-compatible behaviors. This misalignment arises from two interrelated limitations: (1) LLMs often produce subgoals that are semantically plausible but infeasible or irrelevant in the target environment due to insufficient grounding in environment-specific knowledge, and (2) single-LLM planning conflates generation with self-verification, resulting in overconfident yet unreliable subgoals that frequently fail during execution. To address these challenges, we propose Subgoal Graph-Augmented Actor-Critic-Refiner (SGA-ACR), a framework that integrates an environment-specific subgoal graph and structured entity knowledge with a multi-LLM planning pipeline that explicitly separates generation, critique, and refinement to produce executable and verifiable subgoals. A subgoal tracker further monitors execution progress, provides auxiliary rewards, and adaptively updates the subgoal graph to maintain alignment between plans and actions. Experimental results on 22 diverse tasks in the open-world game "Crafter" demonstrate the effectiveness of our proposed method.

Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

Computation and Language

Helps robots plan long tasks by remembering past actions.

29 Jan 2026 1

90%

Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

Machine Learning (CS)

Teaches computers to solve math problems better.

8 Jan 2026 2

90%

SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments

Artificial Intelligence

Teaches computers to plan faster without asking experts.

10 Dec 2025 0

View PDF Login to Bookmark

Page Count

21 pages

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Helps robots follow plans by checking steps.

Technical Abstract

Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments