An Open-Source HW-SW Co-Development Framework Enabling Efficient Multi-Accelerator Systems
By: Ryan Albert Antonio , Joren Dumoulin , Xiaoling Yi and more
Potential Business Impact:
Makes AI faster by connecting computer parts better.
Heterogeneous accelerator-centric compute clusters are emerging as efficient solutions for diverse AI workloads. However, current integration strategies often compromise data movement efficiency and encounter compatibility issues in hardware and software. This prevents a unified approach that balances performance and ease of use. To this end, we present SNAX, an open-source integrated HW-SW framework enabling efficient multi-accelerator platforms through a novel hybrid-coupling scheme, consisting of loosely coupled asynchronous control and tightly coupled data access. SNAX brings reusable hardware modules designed to enhance compute accelerator utilization, and its customizable MLIR-based compiler to automate key system management tasks, jointly enabling rapid development and deployment of customized multi-accelerator compute clusters. Through extensive experimentation, we demonstrate SNAX's efficiency and flexibility in a low-power heterogeneous SoC. Accelerators can easily be integrated and programmed to achieve > 10x improvement in neural network performance compared to other accelerator systems while maintaining accelerator utilization of > 90% in full system operation.
Similar Papers
SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural Networks
Neural and Evolutionary Computing
Makes smart machines use less power.
Scalable and Efficient Intra- and Inter-node Interconnection Networks for Post-Exascale Supercomputers and Data centers
Hardware Architecture
Makes supercomputers faster by fixing data jams.
SynapticCore-X: A Modular Neural Processing Architecture for Low-Cost FPGA Acceleration
Hardware Architecture
Builds smart computer chips cheaply for AI.