ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy
Mousist
This paper presents ASTREA, the first agentic system executed on flight-heritage hardware (TRL 9) for autonomous spacecraft operations, with on-orbit operation aboard the International Space Station (ISS). Using thermal control as a representative use case, we integrate a resource-constrained Large Language Model (LLM) agent with a reinforcement learning controller in an asynchronous architecture tailored for space-qualified platforms. Ground experiments show that LLM-guided supervision improves thermal stability and reduces violations, confirming the feasibility of combining semantic reasoning with adaptive control under hardware constraints. On-orbit validation aboard the ISS initially faced challenges due to inference latency misaligned with the rapid thermal cycles of Low Earth Orbit (LEO) satellites. Synchronization with the orbit length successfully surpassed the baseline with reduced violations, extended episode durations, and improved CPU utilization. These findings demonstrate the potential for scalable agentic supervision architectures in future autonomous spacecraft.
academic
ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy
This paper presents ASTREA, the first agentic system executed on flight-grade hardware (TRL 9) for autonomous spacecraft operations, with in-orbit validation conducted on the International Space Station (ISS). Using thermal control as a representative use case, the system integrates resource-constrained large language model (LLM) agents with reinforcement learning controllers within an asynchronous architecture customized for space-grade platforms. Ground experiments demonstrate that LLM-guided supervision improves thermal stability and reduces violations, confirming the feasibility of combining semantic reasoning with adaptive control under hardware constraints. In-orbit validation on the ISS initially faced challenges with inference latency mismatched to rapid thermal cycles in low Earth orbit (LEO) satellites. After synchronization with orbital period, the system successfully surpassed baselines, reducing violations, extending operational duration, and improving CPU utilization.
Need for Autonomous Space Operations: With the advancement of lunar and Earth orbital missions, there is a need for space systems capable of operating with minimal human intervention, particularly in environments where communication delays hinder direct ground supervision.
Complexity of Thermal Control: Thermal control is a critical subsystem that must maintain operational integrity of all electronic components while managing limited computational resources in real-time. Traditional approaches rely on pre-programmed rules and ground supervision, lacking flexibility to respond to dynamic thermal loads.
Hardware Resource Constraints: Large language models require substantial hardware resources, conflicting with embedded environments that must maintain radiation tolerance and operate under strict constraints on power consumption, size, and temperature.
Input: Onboard temperature sensor data, CPU utilization status, thermal gradient information
Output: Dynamically adjusted entropy coefficient (α) recommendations to optimize the exploration-exploitation balance of the reinforcement learning agent
Constraints:
Temperature threshold limits (60°C ground, 57°C in-orbit)
Single-core computational resources (core 0 dedicated to agentic system)
Inference latency (ranging from 40 seconds to 8 minutes)
Callejo, E., et al. (2023). Imagin-e: The first step towards extending the cloud into space.
Booz Allen Hamilton and Meta (2025). Booz allen and meta launch space llama.
Maranto, D. (2024). Llmsat: A large language model-based goal-oriented agent for autonomous space exploration.
Touma, T., et al. (2025). Ai space cortex: An experimental system for future era space exploration.
Yang, A., et al. (2024). Qwen2 technical report.
Overall Assessment: This paper holds important pioneering significance in space AI applications. Through rigorous experimental design and comprehensive validation, it establishes a solid foundation for future intelligent spacecraft development. Despite certain technical limitations, its engineering value and academic contributions are substantial and merit in-depth research and further development.