OrbitZoo: Multi-Agent Reinforcement Learning Environment for Orbital Dynamics
Oliveira, Dyreby, Caldas et al.
The increasing number of satellites and orbital debris has made space congestion a critical issue, threatening satellite safety and sustainability. Challenges such as collision avoidance, station-keeping, and orbital maneuvering require advanced techniques to handle dynamic uncertainties and multi-agent interactions. Reinforcement learning (RL) has shown promise in this domain, enabling adaptive, autonomous policies for space operations; however, many existing RL frameworks rely on custom-built environments developed from scratch, which often use simplified models and require significant time to implement and validate the orbital dynamics, limiting their ability to fully capture real-world complexities. To address this, we introduce OrbitZoo, a versatile multi-agent RL environment built on a high-fidelity industry standard library, that enables realistic data generation, supports scenarios like collision avoidance and cooperative maneuvers, and ensures robust and accurate orbital dynamics. The environment is validated against a real satellite constellation, Starlink, achieving a Mean Absolute Percentage Error (MAPE) of 0.16% compared to real-world data. This validation ensures reliability for generating high-fidelity simulations and enabling autonomous and independent satellite operations.
academic
OrbitZoo: Multi-Agent Reinforcement Learning Environment for Orbital Dynamics
With the increasing number of satellites and orbital debris, space congestion has become a critical threat to satellite safety and sustainability. Challenges such as collision avoidance, station-keeping, and orbital maneuvers require advanced techniques to handle dynamic uncertainty and multi-agent interactions. Reinforcement Learning (RL) has shown promise in this domain, capable of providing adaptive and autonomous strategies for space operations; however, many existing RL frameworks rely on custom environments built from scratch, often using simplified models that require substantial time to implement and validate orbital dynamics, limiting their ability to fully capture real-world complexity. To address this issue, this paper introduces OrbitZoo, a versatile multi-agent RL environment built upon high-fidelity industry-standard libraries, enabling realistic data generation, supporting scenarios such as collision avoidance and cooperative maneuvers, and ensuring robust and accurate orbital dynamics. The environment has been validated against real Starlink satellite constellation data, achieving a mean absolute percentage error (MAPE) of 0.16% compared to real-world data.
Space Congestion Problem: Since 1957, approximately 20,000 satellites have been launched by humanity, with approximately 140 million debris objects currently in the orbital environment, of which approximately 1 million are larger than 1 centimeter, sufficient to cause catastrophic damage upon impact.
Kessler Syndrome Threat: Debris collisions generate more debris, creating a chain reaction that could render Earth's orbits unusable.
Limitations of Traditional Approaches: Current satellite maneuver solutions heavily rely on manual processes, which become unsustainable as the number of satellites and orbital debris continues to grow.
Automation Requirements: Need to develop faster and more capable autonomous intelligent decision-making systems.
RL Application Potential: RL demonstrates excellence in real-time adaptation to complex, dynamic, and nonlinear space systems.
Lack of Standardization: Existing RL frameworks lack standardization, with most based on simplified models that struggle to capture real-world complexity.
High-Fidelity Data Generation: Built on Python and powerful space dynamics libraries, integrating realistic forces and perturbations, providing accurate datasets that support parallel computation for fast propagation.
Multi-Agent Reinforcement Learning Support: A standardized RL research platform leveraging the PettingZoo library to support multi-agent RL with partially observable Markov decision process (POMDP) structures, scalable to systems with thousands of celestial bodies.
Customizable Framework and Visualization: Modular design allowing users to define arbitrary numbers of celestial body scenarios, integrate custom models, with clear abstraction layer separation, providing interactive 3D visualization components.
Real-World Validation: Through comparative validation with the Starlink satellite constellation, achieving 0.16% MAPE, ensuring the reliability of high-fidelity simulation.
Starlink ephemeris data: Satellite orbit validation data
Related orbital RL research: Kolosa (2019), Herrera (2020), Casas (2022), etc.
Summary: OrbitZoo is an open-source multi-agent reinforcement learning environment with significant academic and practical value. Through high-fidelity orbital dynamics modeling and real-world data validation, it provides a powerful tool for research and development of autonomous space systems. This work not only advances RL applications in aerospace but also makes important contributions to standardized development in this interdisciplinary field.