Score: 0

On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances

Published: September 20, 2025 | arXiv ID: 2509.16746v2

By: Sayak Mukherjee, Ramij R. Hossain, Mahantesh Halappanavar

Potential Business Impact:

Helps robots learn to control things even with surprises.

Business Areas:
Embedded Systems Hardware, Science and Engineering, Software

We analyze offline designs of linear quadratic regulator (LQR) strategies with uncertain disturbances. First, we consider the scenario where the exogenous variable can be estimated in a controlled environment, and subsequently, consider a more practical and challenging scenario where it is unknown in a stochastic setting. Our approach builds on the fundamental learning-based framework of adaptive dynamic programming (ADP), combined with a Lyapunov-based analytical methodology to design the algorithms and derive sample-based approximations motivated from the Markov decision process (MDP)-based approaches. For the scenario involving non-measurable disturbances, we further establish stability and convergence guarantees for the learned control gains under sample-based approximations. The overall methodology emphasizes simplicity while providing rigorous guarantees. Finally, numerical experiments focus on the intricacies and validations for the design of offline continuous-time LQR with exogenous disturbances.

Page Count
17 pages

Category
Electrical Engineering and Systems Science:
Systems and Control