How many users have been here for a long time? Efficient solutions for counting long aggregated visits
By: Peyman Afshani , Rezaul Chowdhury , Inge Li Gørtz and more
This paper addresses the Counting Long Aggregated Visits problem, which is defined as follows. We are given $n$ users and $m$ regions, where each user spends some time visiting some regions. For a parameter $k$ and a query consisting of a subset of $r$ regions, the task is to count the number of distinct users whose aggregate time spent visiting the query regions is at least $k$. This problem is motivated by queries arising in the analysis of large-scale mobility datasets. We present several exact and approximate data structures for supporting counting long aggregated visits, as well as conditional and unconditional lower bounds. First, we describe an exact data structure that exhibits a space-time tradeoff, as well as efficient approximate solutions based on sampling and sketching techniques. We then study the problem in geometric settings where regions are points in $\mathbb{R}^d$ and queries are hyperrectangles, and derive exact data structures that achieve improved performance in these structured spaces.
Similar Papers
Approximate Counting in Local Lemma Regimes
Data Structures and Algorithms
Counts hard-to-count things much faster.
Weighted $k$-Path and Other Problems in Almost $O^*(2^k)$ Deterministic Time via Dynamic Representative Sets
Data Structures and Algorithms
Finds shortest paths with many stops faster.
Estimating Hitting Times Locally At Scale
Data Structures and Algorithms
Finds how long things take to connect online.