Score: 0

D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection

Published: May 4, 2025 | arXiv ID: 2505.01979v1

By: Chenran Zhao , Dianxi Shi , Mengzhu Wang and more

Potential Business Impact:

Teaches robots to make better choices by understanding cause and effect.

Business Areas:

Virtual Reality Hardware, Software

Current Hierarchical Reinforcement Learning (HRL) algorithms excel in long-horizon sequential decision-making tasks but still face two challenges: delay effects and spurious correlations. To address them, we propose a causal HRL approach called D3HRL. First, D3HRL models delayed effects as causal relationships across different time spans and employs distributed causal discovery to learn these relationships. Second, it employs conditional independence testing to eliminate spurious correlations. Finally, D3HRL constructs and trains hierarchical policies based on the identified true causal relationships. These three steps are iteratively executed, gradually exploring the complete causal chain of the task. Experiments conducted in 2D-MineCraft and MiniGrid show that D3HRL demonstrates superior sensitivity to delay effects and accurately identifies causal relationships, leading to reliable decision-making in complex environments.

Hierarchical Reinforcement Learning with Targeted Causal Interventions

Machine Learning (CS)

Teaches robots to learn tasks faster.

6 Jul 2025 1

88%

Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning

Machine Learning (CS)

Teaches cars to drive safely in tricky traffic.

25 Jan 2025 0

88%

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning

Artificial Intelligence

Teaches computers to learn and plan better.

16 Jun 2025 2

View PDF Login to Bookmark

Page Count

26 pages

D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection

Teaches robots to make better choices by understanding cause and effect.

Technical Abstract

Hierarchical Reinforcement Learning with Targeted Causal Interventions

Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning