Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReID
By: Yu-Hsi Chen
Potential Business Impact:
Tracks many drones in heat cameras better.
Detecting and tracking multiple unmanned aerial vehicles (UAVs) in thermal infrared video is inherently challenging due to low contrast, environmental noise, and small target sizes. This paper provides a straightforward approach to address multi-UAV tracking in thermal infrared video, leveraging recent advances in detection and tracking. Instead of relying on the well-established YOLOv5 with DeepSORT combination, we present a tracking framework built on YOLOv12 and BoT-SORT, enhanced with tailored training and inference strategies. We evaluate our approach following the 4th Anti-UAV Challenge metrics and reach competitive performance. Notably, we achieved strong results without using contrast enhancement or temporal information fusion to enrich UAV features, highlighting our approach as a "Strong Baseline" for multi-UAV tracking tasks. We provide implementation details, in-depth experimental analysis, and a discussion of potential improvements. The code is available at https://github.com/wish44165/YOLOv12-BoT-SORT-ReID .
Similar Papers
Real-Time Multi-Object Tracking using YOLOv8 and SORT on a SoC FPGA
CV and Pattern Recognition
Helps robots see and follow many things.
Enhanced Small Target Detection via Multi-Modal Fusion and Attention Mechanisms: A YOLOv5 Approach
CV and Pattern Recognition
Spots tiny, hard-to-see things in pictures.
YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion
CV and Pattern Recognition
Finds tiny drones in busy skies.