SELECT: A Submodular Approach for Active LiDAR Semantic Segmentation
By: Ruiyu Mao , Sarthak Kumar Maharana , Xulong Tang and more
Potential Business Impact:
Helps self-driving cars learn from less data.
LiDAR-based semantic segmentation plays a vital role in autonomous driving by enabling detailed understanding of 3D environments. However, annotating LiDAR point clouds is extremely costly and requires assigning semantic labels to millions of points with complex geometric structures. Active Learning (AL) has emerged as a promising approach to reduce labeling costs by querying only the most informative samples. Yet, existing AL methods face critical challenges when applied to large-scale 3D data: outdoor scenes contain an overwhelming number of points and suffer from severe class imbalance, where rare classes have far fewer points than dominant classes. To address these issues, we propose SELECT, a voxel-centric submodular approach tailored for active LiDAR semantic segmentation. Our method targets both scalability problems and class imbalance through three coordinated stages. First, we perform Voxel-Level Submodular Subset Selection, which efficiently identifies representative voxels without pairwise comparisons, ensuring scalability. Second, we estimate Voxel-Level Model Uncertainty using Monte Carlo dropout, aggregating point-wise uncertainties to identify informative voxels. Finally, we introduce Submodular Maximization for Point-Level Class Balancing, which selects a subset of points that enhances label diversity, explicitly mitigating class imbalance. Experiments on SemanticPOSS, SemanticKITTI, and nuScenes benchmarks demonstrate that SELECT achieves superior performance compared to prior active learning approaches for 3D semantic segmentation.
Similar Papers
Label-Efficient Point Cloud Segmentation with Active Learning
CV and Pattern Recognition
Teaches computers to learn from less 3D data.
Label-efficient Single Photon Images Classification via Active Learning
Image and Video Processing
Teaches computers to understand tricky pictures faster.
Integrating Semi-Supervised and Active Learning for Semantic Segmentation
CV and Pattern Recognition
Teaches computers to learn from less labeled data.