Meta-Optimization and Program Search using Language Models for Task and Motion Planning
By: Denis Shcherba , Eckart Cobo-Briesewitz , Cornelius V. Braun and more
Potential Business Impact:
Robots learn to do tasks better by thinking and moving.
Intelligent interaction with the real world requires robotic agents to jointly reason over high-level plans and low-level controls. Task and motion planning (TAMP) addresses this by combining symbolic planning and continuous trajectory generation. Recently, foundation model approaches to TAMP have presented impressive results, including fast planning times and the execution of natural language instructions. Yet, the optimal interface between high-level planning and low-level motion generation remains an open question: prior approaches are limited by either too much abstraction (e.g., chaining simplified skill primitives) or a lack thereof (e.g., direct joint angle prediction). Our method introduces a novel technique employing a form of meta-optimization to address these issues by: (i) using program search over trajectory optimization problems as an interface between a foundation model and robot control, and (ii) leveraging a zero-order method to optimize numerical parameters in the foundation model output. Results on challenging object manipulation and drawing tasks confirm that our proposed method improves over prior TAMP approaches.
Similar Papers
Hierarchical Temporal Logic Task and Motion Planning for Multi-Robot Systems
Robotics
Robots work together to finish jobs faster.
LLM-GROP: Visually Grounded Robot Task and Motion Planning with Large Language Models
Robotics
Robot learns to set tables using common sense.
Task and Motion Planning for Humanoid Loco-manipulation
Robotics
Robots can now walk and grab things together.