Score: 1

Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection

Published: August 18, 2025 | arXiv ID: 2508.12684v1

By: Zhongyao Li , Peirui Cheng , Liangjin Zhao and more

Potential Business Impact:

Drones see better together, even with less power.

Multi-UAV collaborative 3D detection enables accurate and robust perception by fusing multi-view observations from aerial platforms, offering significant advantages in coverage and occlusion handling, while posing new challenges for computation on resource-constrained UAV platforms. In this paper, we present AdaBEV, a novel framework that learns adaptive instance-aware BEV representations through a refine-and-contrast paradigm. Unlike existing methods that treat all BEV grids equally, AdaBEV introduces a Box-Guided Refinement Module (BG-RM) and an Instance-Background Contrastive Learning (IBCL) to enhance semantic awareness and feature discriminability. BG-RM refines only BEV grids associated with foreground instances using 2D supervision and spatial subdivision, while IBCL promotes stronger separation between foreground and background features via contrastive learning in BEV space. Extensive experiments on the Air-Co-Pred dataset demonstrate that AdaBEV achieves superior accuracy-computation trade-offs across model scales, outperforming other state-of-the-art methods at low resolutions and approaching upper bound performance while maintaining low-resolution BEV inputs and negligible overhead.

BEVCon: Advancing Bird's Eye View Perception with Contrastive Learning

CV and Pattern Recognition

Helps self-driving cars see better from above.

6 Aug 2025 2

88%

BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection

CV and Pattern Recognition

Helps self-driving cars see better in different weather.

17 Sep 2025 1

88%

Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking

CV and Pattern Recognition

Helps self-driving cars see better in 3D.

11 Oct 2025 1

View PDF Login to Bookmark

Page Count

9 pages

Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection

Drones see better together, even with less power.

Technical Abstract

BEVCon: Advancing Bird's Eye View Perception with Contrastive Learning

BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection

Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking