Score: 1

Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

Published: January 29, 2026 | arXiv ID: 2601.21841v1

By: Xiang Li, Ning Yan, Masood Mortazavi

Potential Business Impact:

Helps robots plan long tasks by remembering past actions.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

While Large Language Models (LLMs) have demonstrated strong zero-shot reasoning capabilities, their deployment as embodied agents still faces fundamental challenges in long-horizon planning. Unlike open-ended text generation, embodied agents must decompose high-level intent into actionable sub-goals while strictly adhering to the logic of a dynamic, observed environment. Standard LLM planners frequently fail to maintain strategy coherence over extended horizons due to context window limitation or hallucinate transitions that violate constraints. We propose GiG, a novel planning framework that structures embodied agents' memory using a Graph-in-Graph architecture. Our approach employs a Graph Neural Network (GNN) to encode environmental states into embeddings, organizing these embeddings into action-connected execution trace graphs within an experience memory bank. By clustering these graph embeddings, the framework enables retrieval of structure-aware priors, allowing agents to ground current decisions in relevant past structural patterns. Furthermore, we introduce a novel bounded lookahead module that leverages symbolic transition logic to enhance the agents' planning capabilities through the grounded action projection. We evaluate our framework on three embodied planning benchmarks-Robotouille Synchronous, Robotouille Asynchronous, and ALFWorld. Our method outperforms state-of-the-art baselines, achieving Pass@1 performance gains of up to 22% on Robotouille Synchronous, 37% on Asynchronous, and 15% on ALFWorld with comparable or lower computational cost.

EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

CV and Pattern Recognition

Robots learn to do tasks in the real world.

23 Oct 2025 1

91%

LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation

Robotics

Robot learns to do tasks even when things change.

24 Dec 2025 0

91%

Plan Verification for LLM-Based Embodied Task Completion Agents

Artificial Intelligence

Makes robots learn better by fixing their mistakes.

2 Sep 2025 2

View PDF Login to Bookmark

Page Count

20 pages

Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model

Helps robots plan long tasks by remembering past actions.

Technical Abstract

EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation

Plan Verification for LLM-Based Embodied Task Completion Agents