STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
By: Sicheng Shen , Dongcheng Zhao , Linghao Feng and more
Potential Business Impact:
Tests brain-like computer chips for better performance.
Spiking Transformers have recently emerged as promising architectures for combining the efficiency of spiking neural networks with the representational power of self-attention. However, the lack of standardized implementations, evaluation pipelines, and consistent design choices has hindered fair comparison and principled analysis. In this paper, we introduce \textbf{STEP}, a unified benchmark framework for Spiking Transformers that supports a wide range of tasks, including classification, segmentation, and detection across static, event-based, and sequential datasets. STEP provides modular support for diverse components such as spiking neurons, input encodings, surrogate gradients, and multiple backends (e.g., SpikingJelly, BrainCog). Using STEP, we reproduce and evaluate several representative models, and conduct systematic ablation studies on attention design, neuron types, encoding schemes, and temporal modeling capabilities. We also propose a unified analytical model for energy estimation, accounting for spike sparsity, bitwidth, and memory access, and show that quantized ANNs may offer comparable or better energy efficiency. Our results suggest that current Spiking Transformers rely heavily on convolutional frontends and lack strong temporal modeling, underscoring the need for spike-native architectural innovations. The full code is available at: https://github.com/Fancyssc/STEP
Similar Papers
STEP: Structured Training and Evaluation Platform for benchmarking trajectory prediction models
Machine Learning (CS)
Tests self-driving car predictions better.
Hardware Efficient Accelerator for Spiking Transformer With Reconfigurable Parallel Time Step Computing
Hardware Architecture
Makes AI brains use less power to think.
STF: Shallow-Level Temporal Feedback to Enhance Spiking Transformers
Neural and Evolutionary Computing
Makes computer brains learn better with less energy.