Is Discretization Fusion All You Need for Collaborative Perception?
By: Kang Yang , Tianci Bu , Lantao Li and more
Potential Business Impact:
Helps self-driving cars see farther and better.
Collaborative perception in multi-agent system enhances overall perceptual capabilities by facilitating the exchange of complementary information among agents. Current mainstream collaborative perception methods rely on discretized feature maps to conduct fusion, which however, lacks flexibility in extracting and transmitting the informative features and can hardly focus on the informative features during fusion. To address these problems, this paper proposes a novel Anchor-Centric paradigm for Collaborative Object detection (ACCO). It avoids grid precision issues and allows more flexible and efficient anchor-centric communication and fusion. ACCO is composed by three main components: (1) Anchor featuring block (AFB) that targets to generate anchor proposals and projects prepared anchor queries to image features. (2) Anchor confidence generator (ACG) is designed to minimize communication by selecting only the features in the confident anchors to transmit. (3) A local-global fusion module, in which local fusion is anchor alignment-based fusion (LAAF) and global fusion is conducted by spatial-aware cross-attention (SACA). LAAF and SACA run in multi-layers, so agents conduct anchor-centric fusion iteratively to adjust the anchor proposals. Comprehensive experiments are conducted to evaluate ACCO on OPV2V and Dair-V2X datasets, which demonstrate ACCO's superiority in reducing the communication volume, and in improving the perception range and detection performances. Code can be found at: \href{https://github.com/sidiangongyuan/ACCO}{https://github.com/sidiangongyuan/ACCO}.
Similar Papers
Fast2comm:Collaborative perception combined with prior knowledge
CV and Pattern Recognition
Helps self-driving cars see better with less data.
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
CV and Pattern Recognition
Lets cars see around corners together.
Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection
CV and Pattern Recognition
Cars share what they see to drive safer.