EDAN: Towards Understanding Memory Parallelism and Latency Sensitivity in HPC
By: Siyuan Shen , Mikhail Khalilov , Lukas Gianinazzi and more
Resource disaggregation is a promising technique for improving the efficiency of large-scale computing systems. However, this comes at the cost of increased memory access latency due to the need to rely on the network fabric to transfer data between remote nodes. As such, it is crucial to ascertain an application's memory latency sensitivity to minimize the overall performance impact. Existing tools for measuring memory latency sensitivity often rely on custom ad-hoc hardware or cycle-accurate simulators, which can be inflexible and time-consuming. To address this, we present EDAN (Execution DAG Analyzer), a novel performance analysis tool that leverages an application's runtime instruction trace to generate its corresponding execution DAG. This approach allows us to estimate the latency sensitivity of sequential programs and investigate the impact of different hardware configurations. EDAN not only provides us with the capability of calculating the theoretical bounds for performance metrics, but it also helps us gain insight into the memory-level parallelism inherent to HPC applications. We apply EDAN to applications and benchmarks such as PolyBench, HPCG, and LULESH to unveil the characteristics of their intrinsic memory-level parallelism and latency sensitivity.
Similar Papers
Dynamic Edge Server Selection in Time-Varying Environments: A Reliability-Aware Predictive Approach
Distributed, Parallel, and Cluster Computing
Keeps apps fast by picking the best server.
EDM: An Ultra-Low Latency Ethernet Fabric for Memory Disaggregation
Operating Systems
Makes computers share memory super fast.
The Future of Memory: Limits and Opportunities
Hardware Architecture
Makes computers faster by putting memory closer.