Score: 0

An Efficient Compression of Deep Neural Network Checkpoints Based on Prediction and Context Modeling

Published: June 13, 2025 | arXiv ID: 2506.12000v1

By: Yuriy Kim, Evgeny Belyaev

Potential Business Impact:

Shrinks computer learning files to save space.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

This paper is dedicated to an efficient compression of weights and optimizer states (called checkpoints) obtained at different stages during a neural network training process. First, we propose a prediction-based compression approach, where values from the previously saved checkpoint are used for context modeling in arithmetic coding. Second, in order to enhance the compression performance, we also propose to apply pruning and quantization of the checkpoint values. Experimental results show that our approach achieves substantial bit size reduction, while enabling near-lossless training recovery from restored checkpoints, preserving the model's performance and making it suitable for storage-limited environments.

Coding for Computation: Efficient Compression of Neural Networks for Reconfigurable Hardware

Machine Learning (CS)

Makes smart computer programs run much faster.

24 Apr 2025 1

88%

Optimizing Deep Neural Networks using Safety-Guided Self Compression

Machine Learning (CS)

Shrinks smart computer programs without losing smarts.

1 May 2025 0

87%

Lossless Compression of Neural Network Components: Weights, Checkpoints, and K/V Caches in Low-Precision Formats

Machine Learning (CS)

Shrinks AI models to save space and speed.

20 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇷🇺 Russian Federation

Page Count

4 pages

An Efficient Compression of Deep Neural Network Checkpoints Based on Prediction and Context Modeling

Shrinks computer learning files to save space.

Technical Abstract

Coding for Computation: Efficient Compression of Neural Networks for Reconfigurable Hardware

Optimizing Deep Neural Networks using Safety-Guided Self Compression

Lossless Compression of Neural Network Components: Weights, Checkpoints, and K/V Caches in Low-Precision Formats