Mission-Aligned Learning-Informed Control of Autonomous Systems: Formulation and Foundations
By: Vyacheslav Kungurtsev , Gustav Sir , Akhil Anand and more
Potential Business Impact:
Robots learn to do tasks safely and reliably.
Research, innovation and practical capital investment have been increasing rapidly toward the realization of autonomous physical agents. This includes industrial and service robots, unmanned aerial vehicles, embedded control devices, and a number of other realizations of cybernetic/mechatronic implementations of intelligent autonomous devices. In this paper, we consider a stylized version of robotic care, which would normally involve a two-level Reinforcement Learning procedure that trains a policy for both lower level physical movement decisions as well as higher level conceptual tasks and their sub-components. In order to deliver greater safety and reliability in the system, we present the general formulation of this as a two-level optimization scheme which incorporates control at the lower level, and classical planning at the higher level, integrated with a capacity for learning. This synergistic integration of multiple methodologies -- control, classical planning, and RL -- presents an opportunity for greater insight for algorithm development, leading to more efficient and reliable performance. Here, the notion of reliability pertains to physical safety and interpretability into an otherwise black box operation of autonomous agents, concerning users and regulators. This work presents the necessary background and general formulation of the optimization framework, detailing each component and its integration with the others.
Similar Papers
Safe Online Control-Informed Learning
Systems and Control
Teaches robots to learn safely and quickly.
A Review of Learning-Based Motion Planning: Toward a Data-Driven Optimal Control Approach
Robotics
Makes self-driving cars safer and smarter.
Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems
Robotics
AI learns to fix its own mistakes.