Score: 0

Hidden Sketch: A Space-Efficient Reversible Sketch for Tracking Frequent Items in Data Streams

Published: May 18, 2025 | arXiv ID: 2505.12293v1

By: Zicang Xu , Yuxuan Tian , Yuhan Wu and more

Potential Business Impact:

Tracks popular items in data streams using less memory.

Business Areas:
Darknet Internet Services

Modern data stream applications demand memory-efficient solutions for accurately tracking frequent items, such as heavy hitters and heavy changers, under strict resource constraints. Traditional sketches face inherent accuracy-memory trade-offs: they either lose precision to reduce memory usage or inflate memory costs to enable high recording capacity. This paper introduces Hidden Sketch, a space-efficient reversible data structure for key and frequency encoding. Our design uniquely combines a Reversible Bloom Filter (RBF) and a Count-Min (CM) Sketch for invertible key and frequency storage, enabling precise reconstruction for both keys and their frequencies with minimal memory. Theoretical analysis establishes Hidden Sketch's space complexity and guaranteed reversibility, while extensive experiments demonstrate its substantial improvements in accuracy and space efficiency in frequent item tracking tasks. By eliminating the trade-off between reversibility and space efficiency, Hidden Sketch provides a scalable foundation for real-time stream analytics in resource-constrained environments.

Country of Origin
🇨🇳 China

Page Count
11 pages

Category
Computer Science:
Databases