Score: 2

Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation

Published: December 10, 2025 | arXiv ID: 2512.10099v1

By: Steven Caro, Stephen L. Smith

Potential Business Impact:

Robots learn to push objects through messes.

Business Areas:

Robotics Hardware, Science and Engineering, Software

Nonprehensile manipulation, such as pushing objects across cluttered environments, presents a challenging control problem due to complex contact dynamics and long-horizon planning requirements. In this work, we propose HeRD, a hierarchical reinforcement learning-diffusion policy that decomposes pushing tasks into two levels: high-level goal selection and low-level trajectory generation. We employ a high-level reinforcement learning (RL) agent to select intermediate spatial goals, and a low-level goal-conditioned diffusion model to generate feasible, efficient trajectories to reach them. This architecture combines the long-term reward maximizing behaviour of RL with the generative capabilities of diffusion models. We evaluate our method in a 2D simulation environment and show that it outperforms the state-of-the-art baseline in success rate, path efficiency, and generalization across multiple environment configurations. Our results suggest that hierarchical control with generative low-level planning is a promising direction for scalable, goal-directed nonprehensile manipulation. Code, documentation, and trained models are available: https://github.com/carosteven/HeRD.

Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

Robotics

Robot dogs learn to dribble a ball over rough ground.

21 Apr 2025 0

88%

Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots

Artificial Intelligence

Robots learn to navigate complex places faster.

26 Apr 2025 0

88%

Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive Targets

Machine Learning (CS)

Guides many robots to herd moving things.

3 Apr 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Repos / Data Links

github.com

Page Count

8 pages

Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation

Robots learn to push objects through messes.

Technical Abstract

Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots

Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive Targets