GPZ: GPU-Accelerated Lossy Compressor for Particle Data
By: Ruoyu Li , Yafan Huang , Longtao Zhang and more
Potential Business Impact:
Makes huge science data smaller and faster.
Particle-based simulations and point-cloud applications generate massive, irregular datasets that challenge storage, I/O, and real-time analytics. Traditional compression techniques struggle with irregular particle distributions and GPU architectural constraints, often resulting in limited throughput and suboptimal compression ratios. In this paper, we present GPZ, a high-performance, error-bounded lossy compressor designed specifically for large-scale particle data on modern GPUs. GPZ employs a novel four-stage parallel pipeline that synergistically balances high compression efficiency with the architectural demands of massively parallel hardware. We introduce a suite of targeted optimizations for computation, memory access, and GPU occupancy that enables GPZ to achieve near-hardware-limit throughput. We conduct an extensive evaluation on three distinct GPU architectures (workstation, data center, and edge) using six large-scale, real-world scientific datasets from five distinct domains. The results demonstrate that GPZ consistently and significantly outperforms five state-of-the-art GPU compressors, delivering up to 8x higher end-to-end throughput while simultaneously achieving superior compression ratios and data quality.
Similar Papers
A High-Throughput GPU Framework for Adaptive Lossless Compression of Floating-Point Data
Databases
Shrinks big computer data without losing any details.
GPU-Based Floating-point Adaptive Lossless Compression
Databases
Makes computer data smaller, faster, and perfect.
Physics-Aware Compression of Plasma Distribution Functions with GPU-Accelerated Gaussian Mixture Models
Computational Engineering, Finance, and Science
Shrinks big science data, keeping important science details.