Small Ensemble-based Data Assimilation: A Machine Learning-Enhanced Data Assimilation Method with Limited Ensemble Size
By: Zhilin Li , Yao Zhou , Xianglong Li and more
Potential Business Impact:
Makes computer weather forecasts more accurate, faster.
Ensemble-based data assimilation (DA) methods have become increasingly popular due to their inherent ability to address nonlinear dynamic problems. However, these methods often face a trade-off between analysis accuracy and computational efficiency, as larger ensemble sizes required for higher accuracy also lead to greater computational cost. In this study, we propose a novel machine learning-based data assimilation approach that combines the traditional ensemble Kalman filter (EnKF) with a fully connected neural network (FCNN). Specifically, our method uses a relatively small ensemble size to generate preliminary yet suboptimal analysis states via EnKF. A FCNN is then employed to learn and predict correction terms for these states, thereby mitigating the performance degradation induced by the limited ensemble size. We evaluate the performance of our proposed EnKF-FCNN method through numerical experiments involving Lorenz systems and nonlinear ocean wave field simulations. The results consistently demonstrate that the new method achieves higher accuracy than traditional EnKF with the same ensemble size, while incurring negligible additional computational cost. Moreover, the EnKF-FCNN method is adaptable to diverse applications through coupling with different models and the use of alternative ensemble-based DA methods.
Similar Papers
Flow Matching-Based Generative Modeling for Efficient and Scalable Data Assimilation
Machine Learning (Stat)
Makes computer weather forecasts faster and better.
Flow Matching-Based Generative Modeling for Efficient and Scalable Data Assimilation
Machine Learning (Stat)
Helps computers guess what's happening faster.
Reinforcement learning based data assimilation for unknown state model
Machine Learning (CS)
Learns how things work from messy data.