Score: 0

A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN Accelerator

Published: November 18, 2025 | arXiv ID: 2511.14202v1

By: Weiping Yang , Shilin Zhou , Hui Xu and more

Potential Business Impact:

Makes AI chips faster and use less power.

Business Areas:

A/B Testing Data and Analytics

Compute-in-Memory (CIM) and weight sparsity are two effective techniques to reduce data movement during Neural Network (NN) inference. However, they can hardly be employed in the same accelerator simultaneously because CIM requires structural compute patterns which are disrupted in sparse NNs. In this paper, we partially solve this issue by proposing a bit level weight reordering strategy which can realize compact mapping of sparse NN weight matrices onto Resistive Random Access Memory (RRAM) based NN Accelerators (RRAM-Acc). In specific, when weights are mapped to RRAM crossbars in a binary complement manner, we can observe that, which can also be mathematically proven, bit-level sparsity and similarity commonly exist in the crossbars. The bit reordering method treats bit sparsity as a special case of bit similarity, reserve only one column in a pair of columns that have identical bit values, and then map the compressed weight matrices into Operation Units (OU). The performance of our design is evaluated with typical NNs. Simulation results show a 61.24% average performance improvement and 1.51x-2.52x energy savings under different sparsity ratios, with only slight overhead compared to the state-of-the-art design.

Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory

Hardware Architecture

Makes computer chips use less power for AI.

22 Dec 2025 0

88%

CIMinus: Empowering Sparse DNN Workloads Modeling and Exploration on SRAM-based CIM Architectures

Hardware Architecture

Helps computers learn faster by using less energy.

20 Nov 2025 0

88%

Reconfigurable Digital RRAM Logic Enables In-Situ Pruning and Learning for Edge AI

Hardware Architecture

Makes AI learn faster and use less power.

16 Jun 2025 0

View PDF Login to Bookmark

Page Count

10 pages

A Bit Level Weight Reordering Strategy Based on Column Similarity to Explore Weight Sparsity in RRAM-based NN Accelerator

Makes AI chips faster and use less power.

Technical Abstract

Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory

CIMinus: Empowering Sparse DNN Workloads Modeling and Exploration on SRAM-based CIM Architectures

Reconfigurable Digital RRAM Logic Enables In-Situ Pruning and Learning for Edge AI