Sketch Disaggregation Across Time and Space
By: Jonatan Langlet , Peiqing Chen , Michael Mitzenmacher and more
Potential Business Impact:
Splits data summaries across many network devices.
Streaming analytics are essential in a large range of applications, including databases, networking, and machine learning. To optimize performance, practitioners are increasingly offloading such analytics to network nodes such as switches. However, resources such as fast SRAM memory available at switches are limited, not uniform, and may serve other functionalities as well (e.g., firewall). Moreover, resource availability can also change over time due to the dynamic demands of in-network applications. In this paper, we propose a new approach to disaggregating data structures over time and space, leveraging any residual resource available at network nodes. We focus on sketches, which are fundamental for summarizing data for streaming analytics while providing beneficial space-accuracy tradeoffs. Our idea is to break sketches into multiple `fragments' that are placed at different network nodes. The fragments cover different time periods and are of varying sizes, and are combined to form a network-wide view of the underlying traffic. We apply our solution to three popular sketches (namely, Count Sketch, Count-Min Sketch, and UnivMon) and demonstrate we can achieve approximately a 75% memory size reduction for the same error for many queries, or a near order-of-magnitude error reduction if memory is kept unchanged.
Similar Papers
Distributed Recoverable Sketches (Extended Version)
Distributed, Parallel, and Cluster Computing
Helps networks remember lost data after crashes.
Memory-efficient Sketch Acceleration for Handling Large Network Flows on FPGAs
Hardware Architecture
Makes internet traffic tracking faster and more accurate.
Sketched Sum-Product Networks for Joins
Databases
Makes computer searches faster by guessing results.