Score: 1

An Open-Source HW-SW Co-Development Framework Enabling Efficient Multi-Accelerator Systems

Published: August 20, 2025 | arXiv ID: 2508.14582v1

By: Ryan Albert Antonio , Joren Dumoulin , Xiaoling Yi and more

Potential Business Impact:

Makes AI faster by connecting computer parts better.

Business Areas:

Application Specific Integrated Circuit (ASIC) Hardware

Heterogeneous accelerator-centric compute clusters are emerging as efficient solutions for diverse AI workloads. However, current integration strategies often compromise data movement efficiency and encounter compatibility issues in hardware and software. This prevents a unified approach that balances performance and ease of use. To this end, we present SNAX, an open-source integrated HW-SW framework enabling efficient multi-accelerator platforms through a novel hybrid-coupling scheme, consisting of loosely coupled asynchronous control and tightly coupled data access. SNAX brings reusable hardware modules designed to enhance compute accelerator utilization, and its customizable MLIR-based compiler to automate key system management tasks, jointly enabling rapid development and deployment of customized multi-accelerator compute clusters. Through extensive experimentation, we demonstrate SNAX's efficiency and flexibility in a low-power heterogeneous SoC. Accelerators can easily be integrated and programmed to achieve > 10x improvement in neural network performance compared to other accelerator systems while maintaining accelerator utilization of > 90% in full system operation.

SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural Networks

Neural and Evolutionary Computing

Makes smart machines use less power.

18 May 2025 0

86%

Scalable and Efficient Intra- and Inter-node Interconnection Networks for Post-Exascale Supercomputers and Data centers

Hardware Architecture

Makes supercomputers faster by fixing data jams.

6 Nov 2025 0

86%

SynapticCore-X: A Modular Neural Processing Architecture for Low-Cost FPGA Acceleration

Hardware Architecture

Builds smart computer chips cheaply for AI.

16 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇧🇪 Belgium

Repos / Data Links

github.com

Page Count

8 pages

An Open-Source HW-SW Co-Development Framework Enabling Efficient Multi-Accelerator Systems

Makes AI faster by connecting computer parts better.

Technical Abstract

SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural Networks

Scalable and Efficient Intra- and Inter-node Interconnection Networks for Post-Exascale Supercomputers and Data centers

SynapticCore-X: A Modular Neural Processing Architecture for Low-Cost FPGA Acceleration