RTeAAL Sim: Using Tensor Algebra to Represent and Accelerate RTL Simulation (Extended Version)
By: Yan Zhu , Boru Chen , Christopher W. Fletcher and more
Potential Business Impact:
Makes computer chips design much faster.
RTL simulation on CPUs remains a persistent bottleneck in hardware design. State-of-the-art simulators embed the circuit directly into the simulation binary, resulting in long compilation times and execution that is fundamentally CPU frontend-bound, with severe instruction-cache pressure. This work proposes RTeAAL Sim, which reformulates RTL simulation as a sparse tensor algebra problem. By representing RTL circuits as tensors and simulation as a sparse tensor algebra kernel, RTeAAL Sim decouples simulation behavior from binary size and makes RTL simulation amenable to well-studied tensor algebra optimizations. We demonstrate that a prototype of our tensor-based simulator, even with a subset of these optimizations, already mitigates the compilation overhead and frontend pressure and achieves performance competitive with the highly optimized Verilator simulator across multiple CPUs and ISAs.
Similar Papers
T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
Hardware Architecture
Makes smart AI run faster on small devices.
GSIM: Accelerating RTL Simulation for Large-Scale Designs
Hardware Architecture
Makes computer chip design run much faster.
LEGO: Spatial Accelerator Generation and Optimization for Tensor Applications
Hardware Architecture
Builds AI faster and uses less power.