Score: 1

Spectral Informed Mamba for Robust Point Cloud Processing

Published: March 6, 2025 | arXiv ID: 2503.04953v2

By: Ali Bahri , Moslem Yazdanpanah , Mehrdad Noori and more

Potential Business Impact:

Helps computers understand 3D shapes better.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

State space models have shown significant promise in Natural Language Processing (NLP) and, more recently, computer vision. This paper introduces a new methodology leveraging Mamba and Masked Autoencoder networks for point cloud data in both supervised and self-supervised learning. We propose three key contributions to enhance Mamba's capability in processing complex point cloud structures. First, we exploit the spectrum of a graph Laplacian to capture patch connectivity, defining an isometry-invariant traversal order that is robust to viewpoints and better captures shape manifolds than traditional 3D grid-based traversals. Second, we adapt segmentation via a recursive patch partitioning strategy informed by Laplacian spectral components, allowing finer integration and segment analysis. Third, we address token placement in Masked Autoencoder for Mamba by restoring tokens to their original positions, which preserves essential order and improves learning. Extensive experiments demonstrate the improvements of our approach in classification, segmentation, and few-shot tasks over state-of-the-art baselines.

Country of Origin
🇨🇦 Canada

Page Count
16 pages

Category
Computer Science:
CV and Pattern Recognition