Score: 1

Integrating Score-Based Diffusion Models with Machine Learning-Enhanced Localization for Advanced Data Assimilation in Geological Carbon Storage

Published: November 7, 2025 | arXiv ID: 2511.05266v1

By: Gabriel Serrão Seabra , Nikolaj T. Mücke , Vinicius Luiz Santos Silva and more

Potential Business Impact:

Helps store carbon safely underground.

Business Areas:
Intelligent Systems Artificial Intelligence, Data and Analytics, Science and Engineering

Accurate characterization of subsurface heterogeneity is important for the safe and effective implementation of geological carbon storage (GCS) projects. This paper explores how machine learning methods can enhance data assimilation for GCS with a framework that integrates score-based diffusion models with machine learning-enhanced localization in channelized reservoirs during CO$_2$ injection. We employ a machine learning-enhanced localization framework that uses large ensembles ($N_s = 5000$) with permeabilities generated by the diffusion model and states computed by simple ML algorithms to improve covariance estimation for the Ensemble Smoother with Multiple Data Assimilation (ESMDA). We apply ML algorithms to a prior ensemble of channelized permeability fields, generated with the geostatistical model FLUVSIM. Our approach is applied on a CO$_2$ injection scenario simulated using the Delft Advanced Research Terra Simulator (DARTS). Our ML-based localization maintains significantly more ensemble variance than when localization is not applied, while achieving comparable data-matching quality. This framework has practical implications for GCS projects, helping improve the reliability of uncertainty quantification for risk assessment.

Country of Origin
🇬🇧 🇳🇱 United Kingdom, Netherlands

Page Count
52 pages

Category
Computer Science:
Machine Learning (CS)