Physics-Aware Compression of Plasma Distribution Functions with GPU-Accelerated Gaussian Mixture Models
By: Andong Hu , Luca Pennati , Ivy Peng and more
Potential Business Impact:
Shrinks big science data, keeping important science details.
Data compression is a critical technology for large-scale plasma simulations. Storing complete particle information requires Terabyte-scale data storage, and analysis requires ad-hoc scalable post-processing tools. We propose a physics-aware in-situ compression method using Gaussian Mixture Models (GMMs) to approximate electron and ion velocity distribution functions with a number of Gaussian components. This GMM-based method allows us to capture plasma features such as mean velocity and temperature, and it enables us to identify heating processes and generate beams. We first construct a histogram to reduce computational overhead and apply GPU-accelerated, in-situ GMM fitting within iPIC3D, a large-scale implicit Particle-in-Cell simulator, ensuring real-time compression. The compressed representation is stored using the ADIOS 2 library, thus optimizing the I/O process. The GPU and histogramming implementation provides a significant speed-up with respect to GMM on particles (both in time and required memory at run-time), enabling real-time compression. Compared to algorithms like SZ, MGARD, and BLOSC2, our GMM-based method has a physics-based approach, retaining the physical interpretation of plasma phenomena such as beam formation, acceleration, and heating mechanisms. Our GMM algorithm achieves a compression ratio of up to $10^4$, requiring a processing time comparable to, or even lower than, standard compression engines.
Similar Papers
GPZ: GPU-Accelerated Lossy Compressor for Particle Data
Distributed, Parallel, and Cluster Computing
Makes huge science data smaller and faster.
A Portable Multi-GPU Solver for Collisional Plasmas with Coulombic Interactions
Computational Engineering, Finance, and Science
Makes computer simulations of plasmas run much faster.
3D Gaussian Splatting Data Compression with Mixture of Priors
CV and Pattern Recognition
Makes 3D scenes smaller for easier sharing.