FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
By: Yuhang Zhao, Zixing Wang
Potential Business Impact:
Makes cameras see cars faster, even when crowded.
End-to-end object detectors offer a promising NMS-free paradigm for real-time applications, yet their high computational cost remains a significant barrier, particularly for complex scenarios like intersection traffic monitoring. To address this challenge, we propose FlowDet, a high-speed detector featuring a decoupled encoder optimization strategy applied to the DETR architecture. Specifically, FlowDet employs a novel Geometric Deformable Unit (GDU) for traffic-aware geometric modeling and a Scale-Aware Attention (SAA) module to maintain high representational power across extreme scale variations. To rigorously evaluate the model's performance in environments with severe occlusion and high object density, we collected the Intersection-Flow-5k dataset, a new challenging scene for this task. Evaluated on Intersection-Flow-5k, FlowDet establishes a new state-of-the-art. Compared to the strong RT-DETR baseline, it improves AP(test) by 1.5% and AP50(test) by 1.6%, while simultaneously reducing GFLOPs by 63.2% and increasing inference speed by 16.2%. Our work demonstrates a new path towards building highly efficient and accurate detectors for demanding, real-world perception systems. The Intersection-Flow-5k dataset is available at https://github.com/AstronZh/Intersection-Flow-5K.
Similar Papers
FlowDet: Unifying Object Detection and Generative Transport Flows
CV and Pattern Recognition
Finds objects in pictures much faster.
DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cross-Scene Small Object Detection
CV and Pattern Recognition
Finds tiny flaws in pictures from drones.
RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models
CV and Pattern Recognition
Makes cameras see faster and better.