Ensemble data assimilation to diagnose AI-based weather prediction model: A case with ClimaX version 0.3.1
Kotsuki, Shiraishi, Okazaki
Artificial intelligence (AI)-based weather prediction research is growing rapidly and has shown to be competitive with the advanced dynamic numerical weather prediction models. However, research combining AI-based weather prediction models with data assimilation remains limited partially because long-term sequential data assimilation cycles are required to evaluate data assimilation systems. This study proposes using ensemble data assimilation for diagnosing AI-based weather prediction models, and marked the first successful implementation of ensemble Kalman filter with AI-based weather prediction models. Our experiments with an AI-based model ClimaX demonstrated that the ensemble data assimilation cycled stably for the AI-based weather prediction model using covariance inflation and localization techniques within the ensemble Kalman filter. While ClimaX showed some limitations in capturing flow-dependent error covariance compared to dynamical models, the AI-based ensemble forecasts provided reasonable and beneficial error covariance in sparsely observed regions. In addition, ensemble data assimilation revealed that error growth based on ensemble ClimaX predictions was weaker than that of dynamical NWP models, leading to higher inflation factors. A series of experiments demonstrated that ensemble data assimilation can be used to diagnose properties of AI weather prediction models such as physical consistency and accurate error growth representation.
academic
Ensemble data assimilation to diagnose AI-based weather prediction model: A case with ClimaX version 0.3.1
Artificial intelligence (AI) weather forecasting research has developed rapidly and demonstrated competitiveness with advanced dynamical numerical weather prediction (NWP) models. However, research combining AI weather prediction models with data assimilation remains limited, partly because evaluating data assimilation systems requires long sequential data assimilation cycles. This study proposes using ensemble data assimilation to diagnose AI weather prediction models and successfully implements the integration of ensemble Kalman filtering with AI weather prediction models for the first time. Experiments based on the AI model ClimaX demonstrate that ensemble data assimilation can operate stably through sequential cycles by employing covariance inflation and localization techniques within the ensemble Kalman filter. Although ClimaX exhibits limitations compared to dynamical models in capturing flow-dependent error covariance, AI ensemble forecasts provide reasonable and beneficial error covariance in sparsely observed regions. Furthermore, ensemble data assimilation reveals that error growth from ClimaX ensemble forecasts is weaker than that from dynamical NWP models, resulting in higher inflation factors. A series of experiments demonstrate that ensemble data assimilation can be used to diagnose properties of AI weather prediction models such as physical consistency and accurate error growth representation.
Intensifying extreme weather threats: Extreme weather events caused by climate change are becoming increasingly severe, with the World Economic Forum listing extreme weather as one of the most serious global threats
Rapid development of AI weather forecasting: Since Google DeepMind released GraphCast in December 2022, deep learning weather forecasting research has grown rapidly, including Huawei's Pangu-Weather, Microsoft's ClimaX and Stormer, and NVIDIA's FourCastNet
Lagging data assimilation research: Although AI weather prediction models can now compete with state-of-the-art NWP models, research combining AI models with data assimilation remains limited
Technical challenges: The requirement for long sequential data assimilation experiments makes it difficult to evaluate data assimilation systems for AI models
Methodological gaps: While research on variational data assimilation combined with AI models exists, there are no successful cases of ensemble Kalman filtering integrated with AI models
Diagnostic needs: Effective methods are needed to diagnose properties of AI weather prediction models, such as physical consistency and error growth representation
First successful implementation: First successful integration of the Local Ensemble Transform Kalman Filter (LETKF) with an AI weather prediction model (ClimaX)
Stable cyclic operation: Demonstrates that ensemble data assimilation for AI models can operate stably for one year through covariance inflation and localization techniques
Diagnostic framework establishment: Establishes a framework for diagnosing AI weather prediction model characteristics using ensemble data assimilation
Important findings: Reveals limitations of AI models compared to dynamical models in error growth and physical consistency
Technical improvements: Extended ClimaX to support forecasting of more variables to meet data assimilation requirements
The core task of this research is to apply ensemble data assimilation techniques to AI weather prediction models to diagnose their characteristics and evaluate their performance in data assimilation systems. The input consists of atmospheric observations and AI model forecasts, while the output is the assimilated analysis field.
Key components: Variable tokenization and variable aggregation
Extended improvements: Expanded from the default 5 forecast variables to the complete variable set shown in Table 1, supporting data assimilation requirements
System integration: First successful integration of LETKF with AI weather prediction models, developed based on the SPEEDY-LETKF system
Model extension: Extended ClimaX to support the complete variable set required for data assimilation
Diagnostic methods: Utilized optimal localization scales, inflation factors, and other metrics to diagnose AI model characteristics
Observation network design: Adopted an observation network similar to radiosonde observations, with 7-level observations of temperature, wind fields, etc. at observation stations
Short-term forecast limitations: ClimaX cannot perform long-term free integration, gradually deviating from the real atmosphere after 6-hour forecasts
Non-physical field generation: Long-term forecasts produce meteorologically unrealistic weather fields (e.g., extremely low temperatures over the Pacific)
Attractor problem: AI models cannot return to meteorologically reasonable attractor trajectories
Smaller optimal localization scale: 600 km is significantly smaller than the 900 km for dynamical models, indicating insufficient flow-dependent error covariance capture capability
Cannot perform OSSE: Observing System Simulation Experiments cannot be performed due to unstable long-term forecasts
Missing physical constraints: AI models lack constraints from physical laws, easily producing unrealistic weather fields
Lam, R., et al. (2023): Learning skillful medium-range global weather forecasting. Science, 382(6677), 1416-1421.
Bi, K., et al. (2023): Accurate medium-range global weather forecasting with 3D neural networks. Nature, 619(7970), 533-538.
Hunt, B. R., et al. (2007): Efficient data assimilation for spatiotemporal chaos: A local ensemble transform Kalman filter. Physica D, 230(1-2), 112-126.
Nguyen, T., et al. (2023): ClimaX: A foundation model for weather and climate. arXiv preprint arXiv:2301.10343.
This paper has pioneering significance in combining AI weather forecasting with data assimilation. Although it has some technical limitations, it establishes an important foundation for the development of this field and possesses considerable academic value and practical potential.