Policy Optimization in Robust Control: Weak Convexity and Subgradient Methods
By: Yuto Watanabe, Feng-Yi Liao, Yang Zheng
Potential Business Impact:
Makes robots smarter and more reliable.
Robust control seeks stabilizing policies that perform reliably under adversarial disturbances, with $\mathcal{H}_\infty$ control as a classical formulation. It is known that policy optimization of robust $\mathcal{H}_\infty$ control naturally lead to nonsmooth and nonconvex problems. This paper builds on recent advances in nonsmooth optimization to analyze discrete-time static output-feedback $\mathcal{H}_\infty$ control. We show that the $\mathcal{H}_\infty$ cost is weakly convex over any convex subset of a sublevel set. This structural property allows us to establish the first non-asymptotic deterministic convergence rate for the subgradient method under suitable assumptions. In addition, we prove a weak Polyak-{\L}ojasiewicz (PL) inequality in the state-feedback case, implying that all stationary points are globally optimal. We finally present a few numerical examples to validate the theoretical results.
Similar Papers
Beyond Quadratic Costs: A Bregman Divergence Approach to H$_\infty$ Control
Systems and Control
Makes robots safer and more efficient.
Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity
Optimization and Control
Finds the best solution even when it's hard.
Optimal $\mathbb{H}_2$ Control with Passivity-Constrained Feedback: Convex Approach
Optimization and Control
Makes robots move smoothly and stop vibrations.