Single-Agent attention actor-critic: A deep reinforcement learning-based solution for low-thrust spacecraft trajectory optimization

No Thumbnail Available
Authors
Zaidi, Syed Muhammad Talha
Arustei, Adrian
Munir, Arslan
Dutta, Atri
Advisors
Issue Date
2025-05-13
Type
Article
Keywords
Attention actor critic , Automated trajectory optimization , Cislunar transfers , Deep learning , Reinforcement learning
Research Projects
Organizational Units
Journal Issue
Citation
S. M. T. Zaidi, A. Arustei, A. Munir and A. Dutta, "Single-Agent Attention Actor-Critic: A Deep Reinforcement Learning-Based Solution for Low-Thrust Spacecraft Trajectory Optimization," in IEEE Transactions on Aerospace and Electronic Systems, doi: 10.1109/TAES.2025.3569615.
Abstract

This paper introduces a deep reinforcement learning (DRL) based approach for autonomous planning of low-thrust spacecraft trajectories, while addressing the intricate challenges of orbital dynamics and mission design. We propose a single-agent attention actor-critic (SA3C) algorithm, which integrates an attention mechanism to significantly enhance sample efficiency and decision-making capabilities in complex trajectory optimization tasks. Our research extends the application of DRL beyond traditional geocentric transfers, incorporating cislunar mission scenarios where strong third-body gravitational influences play a critical role. The SA3C algorithm provides better results compared to existing automated approaches, demonstrating its effectiveness in optimizing transfers to GEO and near-rectilinear halo orbit (NRHO). We offer a comprehensive comparison of three algorithmic frameworks—sequential, DRL-based, and optimization-based—in terms of optimality and potential for autonomy in spacecraft trajectory planning. Through rigorous evaluation, we demonstrate that attention-based modifications in RL enhance the adaptability and efficiency of low-thrust spacecraft trajectory planning, offering a promising avenue for autonomous and effective mission designs in multi-body gravitational environments. This work contributes toward advancing spacecraft autonomy and optimizing complex orbital maneuvers by introducing the SA3C algorithm, which demonstrates the potential to achieve near-optimal transfer times in geocentric and cislunar missions. © 1965-2011 IEEE.

Table of Contents
Description
Click on the DOI link to access this article at the publishers website (may not be free).
Publisher
Institute of Electrical and Electronics Engineers Inc.
Journal
IEEE Transactions on Aerospace and Electronic Systems
Book Title
Series
PubMed ID
ISSN
00189251
EISSN