Detector-Augmented SAMURAI for Long-Duration Drone Tracking
By: Tamara R. Lenhard , Andreas Weinmann , Hichem Snoussi and more
Potential Business Impact:
Keeps track of drones better, even when they disappear.
Robust long-term tracking of drone is a critical requirement for modern surveillance systems, given their increasing threat potential. While detector-based approaches typically achieve strong frame-level accuracy, they often suffer from temporal inconsistencies caused by frequent detection dropouts. Despite its practical relevance, research on RGB-based drone tracking is still limited and largely reliant on conventional motion models. Meanwhile, foundation models like SAMURAI have established their effectiveness across other domains, exhibiting strong category-agnostic tracking performance. However, their applicability in drone-specific scenarios has not been investigated yet. Motivated by this gap, we present the first systematic evaluation of SAMURAI's potential for robust drone tracking in urban surveillance settings. Furthermore, we introduce a detector-augmented extension of SAMURAI to mitigate sensitivity to bounding-box initialization and sequence length. Our findings demonstrate that the proposed extension significantly improves robustness in complex urban environments, with pronounced benefits in long-duration sequences - especially under drone exit-re-entry events. The incorporation of detector cues yields consistent gains over SAMURAI's zero-shot performance across datasets and metrics, with success rate improvements of up to +0.393 and FNR reductions of up to -0.475.
Similar Papers
Benchmarking SAM2-based Trackers on FMOX
CV and Pattern Recognition
Tracks fast-moving things better in videos.
SDG-Track: A Heterogeneous Observer-Follower Framework for High-Resolution UAV Tracking on Embedded Platforms
CV and Pattern Recognition
Tracks tiny flying robots smoothly, even when hidden.
A Multi-Drone Multi-View Dataset and Deep Learning Framework for Pedestrian Detection and Tracking
CV and Pattern Recognition
Tracks people from many moving cameras.