Zero-Shot Instruction Following in RL via Structured LTL Representations
By: Mattia Giuri, Mathias Jackermeier, Alessandro Abate
Potential Business Impact:
Teaches robots to follow complex, multi-step instructions.
Linear temporal logic (LTL) is a compelling framework for specifying complex, structured tasks for reinforcement learning (RL) agents. Recent work has shown that interpreting LTL instructions as finite automata, which can be seen as high-level programs monitoring task progress, enables learning a single generalist policy capable of executing arbitrary instructions at test time. However, existing approaches fall short in environments where multiple high-level events (i.e., atomic propositions) can be true at the same time and potentially interact in complicated ways. In this work, we propose a novel approach to learning a multi-task policy for following arbitrary LTL instructions that addresses this shortcoming. Our method conditions the policy on sequences of simple Boolean formulae, which directly align with transitions in the automaton, and are encoded via a graph neural network (GNN) to yield structured task representations. Experiments in a complex chess-based environment demonstrate the advantages of our approach.
Similar Papers
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
Artificial Intelligence
Helps robots learn complex tasks without practice.
Logic-based Task Representation and Reward Shaping in Multiagent Reinforcement Learning
Multiagent Systems
Teaches robots to work together faster.
LTL$_f$ Learning Meets Boolean Set Cover
Artificial Intelligence
Finds computer rules much faster.