AttentiveGRU: Recurrent Spatio-Temporal Modeling for Advanced Radar-Based BEV Object Detection
By: Loveneet Saini , Mirko Meuter , Hasan Tercan and more
Potential Business Impact:
Helps cars see better with radar.
Bird's-eye view (BEV) object detection has become important for advanced automotive 3D radar-based perception systems. However, the inherently sparse and non-deterministic nature of radar data limits the effectiveness of traditional single-frame BEV paradigms. In this paper, we addresses this limitation by introducing AttentiveGRU, a novel attention-based recurrent approach tailored for radar constraints, which extracts individualized spatio-temporal context for objects by dynamically identifying and fusing temporally correlated structures across present and memory states. By leveraging the consistency of object's latent representation over time, our approach exploits temporal relations to enrich feature representations for both stationary and moving objects, thereby enhancing detection performance and eliminating the need for externally providing or estimating any information about ego vehicle motion. Our experimental results on the public nuScenes dataset show a significant increase in mAP for the car category by 21% over the best radar-only submission. Further evaluations on an additional dataset demonstrate notable improvements in object detection capabilities, underscoring the applicability and effectiveness of our method.
Similar Papers
DenseBEV: Transforming BEV Grid Cells into 3D Objects
CV and Pattern Recognition
Helps cars see better in 3D.
Graph Query Networks for Object Detection with Automotive Radar
CV and Pattern Recognition
Helps cars see better with radar.
Spatiotemporal Attention Learning Framework for Event-Driven Object Recognition
CV and Pattern Recognition
Helps cameras see fast-moving things clearly.