REVNET: Rotation-Equivariant Point Cloud Completion via Vector Neuron Anchor Transformer
By: Zhifan Ni, Eckehard Steinbach
Incomplete point clouds captured by 3D sensors often result in the loss of both geometric and semantic information. Most existing point cloud completion methods are built on rotation-variant frameworks trained with data in canonical poses, limiting their applicability in real-world scenarios. While data augmentation with random rotations can partially mitigate this issue, it significantly increases the learning burden and still fails to guarantee robust performance under arbitrary poses. To address this challenge, we propose the Rotation-Equivariant Anchor Transformer (REVNET), a novel framework built upon the Vector Neuron (VN) network for robust point cloud completion under arbitrary rotations. To preserve local details, we represent partial point clouds as sets of equivariant anchors and design a VN Missing Anchor Transformer to predict the positions and features of missing anchors. Furthermore, we extend VN networks with a rotation-equivariant bias formulation and a ZCA-based layer normalization to improve feature expressiveness. Leveraging the flexible conversion between equivariant and invariant VN features, our model can generate point coordinates with greater stability. Experimental results show that our method outperforms state-of-the-art approaches on the synthetic MVP dataset in the equivariant setting. On the real-world KITTI dataset, REVNET delivers competitive results compared to non-equivariant networks, without requiring input pose alignment. The source code will be released on GitHub under URL: https://github.com/nizhf/REVNET.
Similar Papers
Transformers with Joint Tokens and Local-Global Attention for Efficient Human Pose Estimation
CV and Pattern Recognition
Helps computers see people's body poses better.
Surfel-based 3D Registration with Equivariant SE(3) Features
CV and Pattern Recognition
Aligns 3D scans better, even when tilted.
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
CV and Pattern Recognition
Makes AI see better, using less computer power.