Spectral Informed Mamba for Robust Point Cloud Processing
By: Ali Bahri , Moslem Yazdanpanah , Mehrdad Noori and more
Potential Business Impact:
Helps computers understand 3D shapes better.
State space models have shown significant promise in Natural Language Processing (NLP) and, more recently, computer vision. This paper introduces a new methodology leveraging Mamba and Masked Autoencoder networks for point cloud data in both supervised and self-supervised learning. We propose three key contributions to enhance Mamba's capability in processing complex point cloud structures. First, we exploit the spectrum of a graph Laplacian to capture patch connectivity, defining an isometry-invariant traversal order that is robust to viewpoints and better captures shape manifolds than traditional 3D grid-based traversals. Second, we adapt segmentation via a recursive patch partitioning strategy informed by Laplacian spectral components, allowing finer integration and segment analysis. Third, we address token placement in Masked Autoencoder for Mamba by restoring tokens to their original positions, which preserves essential order and improves learning. Extensive experiments demonstrate the improvements of our approach in classification, segmentation, and few-shot tasks over state-of-the-art baselines.
Similar Papers
Efficient Spiking Point Mamba for Point Cloud Analysis
CV and Pattern Recognition
Helps computers understand 3D shapes faster, using less power.
SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection
CV and Pattern Recognition
Finds sickness in medical pictures faster.
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection
CV and Pattern Recognition
Helps self-driving cars see in 3D better.