Score: 0

Designing Co-operation in Systems of Hierarchical, Multi-objective Schedulers for Stream Processing

Published: December 8, 2025 | arXiv ID: 2512.07792v1

By: Animesh Dangwal , Yufeng Jiang , Charlie Arnold and more

Potential Business Impact:

Lets computers handle huge data faster.

Business Areas:
Scheduling Information Technology, Software

Stream processing is a computing paradigm that supports real-time data processing for a wide variety of applications. At Meta, it's used across the company for various tasks such as deriving product insights, providing and improving user services, and enabling AI at scale for our ever-growing user base. Meta's current stream processing framework supports processing TerraBytes(TBs) of data in mere seconds. This is enabled by our efficient schedulers and multi-layered infrastructure, which allocate workloads across various compute resources, working together in hierarchies across various parts of the infrastructure. But with the ever growing complexity of applications, and user needs, areas of the infrastructure that previously required minimal load balancing, now must be made more robust and proactive to application load. In our work we explore how to build and design such a system that focuses on load balancing over key compute resources and properties of these applications. We also showcase how to integrate new schedulers into the hierarchy of the existing ones, allowing multiple schedulers to work together and perform load balancing, at their infrastructure level, effectively.

Page Count
8 pages

Category
Computer Science:
Distributed, Parallel, and Cluster Computing