From Edge to HPC: Investigating Cross-Facility Data Streaming Architectures
By: Anjus George , Michael Brim , Christopher Zimmer and more
Potential Business Impact:
Moves computer data faster for science.
In this paper, we investigate three cross-facility data streaming architectures, Direct Streaming (DTS), Proxied Streaming (PRS), and Managed Service Streaming (MSS). We examine their architectural variations in data flow paths and deployment feasibility, and detail their implementation using the Data Streaming to HPC (DS2HPC) architectural framework and the SciStream memory-to-memory streaming toolkit on the production-grade Advanced Computing Ecosystem (ACE) infrastructure at Oak Ridge Leadership Computing Facility (OLCF). We present a workflow-specific evaluation of these architectures using three synthetic workloads derived from the streaming characteristics of scientific workflows. Through simulated experiments, we measure streaming throughput, round-trip time, and overhead under work sharing, work sharing with feedback, and broadcast and gather messaging patterns commonly found in AI-HPC communication motifs. Our study shows that DTS offers a minimal-hop path, resulting in higher throughput and lower latency, whereas MSS provides greater deployment feasibility and scalability across multiple users but incurs significant overhead. PRS lies in between, offering a scalable architecture whose performance matches DTS in most cases.
Similar Papers
A Study on Messaging Trade-offs in Data Streaming for Scientific Workflows
Distributed, Parallel, and Cluster Computing
Moves science data instantly, speeding up discoveries.
To Stream or Not to Stream: Towards A Quantitative Model for Remote HPC Processing Decisions
Distributed, Parallel, and Cluster Computing
Lets scientists analyze huge data instantly.
Designing Co-operation in Systems of Hierarchical, Multi-objective Schedulers for Stream Processing
Distributed, Parallel, and Cluster Computing
Lets computers handle huge data faster.