Information-Theoretic Policy Pre-Training with Empowerment
By: Moritz Schneider , Robert Krug , Narunas Vaskevicius and more
Potential Business Impact:
Teaches robots to learn faster and better.
Empowerment, an information-theoretic measure of an agent's potential influence on its environment, has emerged as a powerful intrinsic motivation and exploration framework for reinforcement learning (RL). Besides for unsupervised RL and skill learning algorithms, the specific use of empowerment as a pre-training signal has received limited attention in the literature. We show that empowerment can be used as a pre-training signal for data-efficient downstream task adaptation. For this we extend the traditional notion of empowerment by introducing discounted empowerment, which balances the agent's control over the environment across short- and long-term horizons. Leveraging this formulation, we propose a novel pre-training paradigm that initializes policies to maximize discounted empowerment, enabling agents to acquire a robust understanding of environmental dynamics. We analyze empowerment-based pre-training for various existing RL algorithms and empirically demonstrate its potential as a general-purpose initialization strategy: empowerment-maximizing policies with long horizons are data-efficient and effective, leading to improved adaptability in downstream tasks. Our findings pave the way for future research to scale this framework to high-dimensional and complex tasks, further advancing the field of RL.
Similar Papers
When Empowerment Disempowers
Artificial Intelligence
AI helping one person can hurt another.
Training LLM Agents to Empower Humans
Artificial Intelligence
Helps computers let people finish tasks faster.
Training LLM Agents to Empower Humans
Artificial Intelligence
Helps computers help people make better choices.