Score: 0

Deep Reinforcement Learning Policies for Underactuated Satellite Attitude Control

Published: April 30, 2025 | arXiv ID: 2505.00165v1

By: Matteo El Hariry , Andrea Cini , Giacomo Mellone and more

Potential Business Impact:

Teaches satellites to point themselves in space.

Business Areas:
Autonomous Vehicles Transportation

Autonomy is a key challenge for future space exploration endeavours. Deep Reinforcement Learning holds the promises for developing agents able to learn complex behaviours simply by interacting with their environment. This paper investigates the use of Reinforcement Learning for the satellite attitude control problem, namely the angular reorientation of a spacecraft with respect to an in- ertial frame of reference. In the proposed approach, a set of control policies are implemented as neural networks trained with a custom version of the Proximal Policy Optimization algorithm to maneuver a small satellite from a random starting angle to a given pointing target. In particular, we address the problem for two working conditions: the nominal case, in which all the actuators (a set of 3 reac- tion wheels) are working properly, and the underactuated case, where an actuator failure is simulated randomly along with one of the axes. We show that the agents learn to effectively perform large-angle slew maneuvers with fast convergence and industry-standard pointing accuracy. Furthermore, we test the proposed method on representative hardware, showing that by taking adequate measures controllers trained in simulation can perform well in real systems.

Page Count
11 pages

Category
Computer Science:
Robotics