Hidden Sketch: A Space-Efficient Reversible Sketch for Tracking Frequent Items in Data Streams
By: Zicang Xu , Yuxuan Tian , Yuhan Wu and more
Potential Business Impact:
Tracks popular items in data streams using less memory.
Modern data stream applications demand memory-efficient solutions for accurately tracking frequent items, such as heavy hitters and heavy changers, under strict resource constraints. Traditional sketches face inherent accuracy-memory trade-offs: they either lose precision to reduce memory usage or inflate memory costs to enable high recording capacity. This paper introduces Hidden Sketch, a space-efficient reversible data structure for key and frequency encoding. Our design uniquely combines a Reversible Bloom Filter (RBF) and a Count-Min (CM) Sketch for invertible key and frequency storage, enabling precise reconstruction for both keys and their frequencies with minimal memory. Theoretical analysis establishes Hidden Sketch's space complexity and guaranteed reversibility, while extensive experiments demonstrate its substantial improvements in accuracy and space efficiency in frequent item tracking tasks. By eliminating the trade-off between reversibility and space efficiency, Hidden Sketch provides a scalable foundation for real-time stream analytics in resource-constrained environments.
Similar Papers
Enhancing Resiliency of Sketch-based Security via LSB Sharing-based Dynamic Late Merging
Cryptography and Security
Protects internet data from hackers better.
Distributed Recoverable Sketches (Extended Version)
Distributed, Parallel, and Cluster Computing
Helps networks remember lost data after crashes.
Sketch Disaggregation Across Time and Space
Networking and Internet Architecture
Splits data summaries across many network devices.