A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
By: Bofei Liu , Dong Ye , Zunhao Yao and more
Potential Business Impact:
Satellites change shape to do many jobs.
Modular self-reconfigurable satellites refer to satellite clusters composed of individual modular units capable of altering their configurations. The configuration changes enable the execution of diverse tasks and mission objectives. Existing path planning algorithms for reconfiguration often suffer from high computational complexity, poor generalization capability, and limited support for diverse target configurations. To address these challenges, this paper proposes a goal-oriented reinforcement learning-based path planning algorithm. This algorithm is the first to address the challenge that previous reinforcement learning methods failed to overcome, namely handling multiple target configurations. Moreover, techniques such as Hindsight Experience Replay and Invalid Action Masking are incorporated to overcome the significant obstacles posed by sparse rewards and invalid actions. Based on these designs, our model achieves a 95% and 73% success rate in reaching arbitrary target configurations in a modular satellite cluster composed of four and six units, respectively.
Similar Papers
Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study
Artificial Intelligence
Lets satellites work together to watch Earth.
On-board Mission Replanning for Adaptive Cooperative Multi-Robot Systems
Robotics
Robots plan better, faster, together, without help.
Multi-Agent Reinforcement Learning for Heterogeneous Satellite Cluster Resources Optimization
Artificial Intelligence
Lets satellites work together to take better pictures.