Stein-based Optimization of Sampling Distributions in Model Predictive Path Integral Control
By: Jace Aldrich, Odest Chadwicke Jenkins
Potential Business Impact:
Makes robots learn to walk and move better.
This paper presents a novel method for Model Predictive Path Integral (MPPI) control that optimizes sample generation towards an optimal trajectory through Stein Variational Gradient Descent (SVGD). MPPI is traditionally reliant on randomly sampled trajectories, often by a Gaussian distribution. The result can lead to sample deprivation, under-representing the space of possible trajectories, and yield suboptimal results. Through introducing SVGD updates in between MPPI environment steps, we present Stein-Optimized Path-Integral Inference (SOPPI), an MPPI/SVGD algorithm that can dynamically update noise distributions at runtime to shape a more optimal representation without an excessive increase in computational requirements. We demonstrate the efficacy of our method systems ranging from a Cart-Pole to a two-dimensional bipedal walking task, indicating improved performance above standard MPPI across a range of hyper-parameters and demonstrate feasibility at lower particle counts. We discuss the applicability of this MPPI/SVGD method to higher degree-of-freedom systems, as well as its potential to new developments in state-of-the-art differentiable simulators.
Similar Papers
Gauss-Newton accelerated MPPI Control
Systems and Control
Makes robots move better in tricky situations.
Geometric Model Predictive Path Integral for Agile UAV Control with Online Collision Avoidance
Robotics
Drones fly fast, avoid trees, and follow paths.
One-Step Model Predictive Path Integral for Manipulator Motion Planning Using Configuration Space Distance Fields
Robotics
Robots move safely and fast around obstacles.