Deadlock-free routing for Full-mesh networks without using Virtual Channels
By: Alejandro Cano , Cristóbal Camarero , Carmen Martínez and more
Potential Business Impact:
Makes computer networks faster without extra parts.
High-radix, low-diameter networks like HyperX and Dragonfly use a Full-mesh core, and rely on multiple virtual channels (VCs) to avoid packet deadlocks in adaptive routing. However, VCs introduce significant overhead in the switch in terms of area, power, and design complexity, limiting the switch scalability. This paper starts by revisiting VC-less routing through link ordering schemes in Full-mesh networks, which offer implementation simplicity but suffer from performance degradation under adversarial traffic. Thus, to overcome these challenges, we propose TERA (Topology-Embedded Routing Algorithm), a novel routing algorithm which employs an embedded physical subnetwork to provide deadlock-free non-minimal paths without using VCs. In a Full-mesh network, TERA outperforms link ordering routing algorithms by 80% when dealing with adversarial traffic, and up to 100% in application kernels. Furthermore, compared to other VC-based approaches, it reduces buffer requirements by 50%, while maintaining comparable latency and throughput. Lastly, early results from a 2D-HyperX evaluation show that TERA outperforms state-of-the-art algorithms that use the same number of VCs, achieving performance improvements of up to 32%.
Similar Papers
TeraNoC: A Multi-Channel 32-bit Fine-Grained, Hybrid Mesh-Crossbar NoC for Efficient Scale-up of 1000+ Core Shared-L1-Memory Clusters
Distributed, Parallel, and Cluster Computing
Makes computer chips faster and smaller.
UB-Mesh: a Hierarchically Localized nD-FullMesh Datacenter Network Architecture
Hardware Architecture
Makes AI computers train faster and cheaper.
Toward Self-Healing Networks-on-Chip: RL-Driven Routing in 2D Torus Architectures
Distributed, Parallel, and Cluster Computing
Makes computer chips work better when parts break.