Score: 0

Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Published: June 3, 2025 | arXiv ID: 2506.02914v1

By: Yechi Ma, Wei Hua, Shu Kong

Potential Business Impact:

Teaches computers to label data automatically.

Business Areas:

Image Recognition Data and Analytics, Software

A crucial yet under-appreciated prerequisite in machine learning solutions for real-applications is data annotation: human annotators are hired to manually label data according to detailed, expert-crafted guidelines. This is often a laborious, tedious, and costly process. To study methods for facilitating data annotation, we introduce a new benchmark AnnoGuide: Auto-Annotation from Annotation Guidelines. It aims to evaluate automated methods for data annotation directly from expert-defined annotation guidelines, eliminating the need for manual labeling. As a case study, we repurpose the well-established nuScenes dataset, commonly used in autonomous driving research, which provides comprehensive annotation guidelines for labeling LiDAR point clouds with 3D cuboids across 18 object classes. These guidelines include a few visual examples and textual descriptions, but no labeled 3D cuboids in LiDAR data, making this a novel task of multi-modal few-shot 3D detection without 3D annotations. The advances of powerful foundation models (FMs) make AnnoGuide especially timely, as FMs offer promising tools to tackle its challenges. We employ a conceptually straightforward pipeline that (1) utilizes open-source FMs for object detection and segmentation in RGB images, (2) projects 2D detections into 3D using known camera poses, and (3) clusters LiDAR points within the frustum of each 2D detection to generate a 3D cuboid. Starting with a non-learned solution that leverages off-the-shelf FMs, we progressively refine key components and achieve significant performance improvements, boosting 3D detection mAP from 12.1 to 21.9! Nevertheless, our results highlight that AnnoGuide remains an open and challenging problem, underscoring the urgent need for developing LiDAR-based FMs. We release our code and models at GitHub: https://annoguide.github.io/annoguide3Dbenchmark

Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding

CV and Pattern Recognition

Teaches computers to understand 3D objects better.

18 Apr 2025 0

87%

3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation

CV and Pattern Recognition

Teaches self-driving cars to see without 3D maps.

6 May 2025 1

87%

Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation

CV and Pattern Recognition

Helps computers map forests from 3D scans faster.

8 Oct 2025 0

View PDF Login to Bookmark

Page Count

21 pages

Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Teaches computers to label data automatically.

Technical Abstract

Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding

3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation

Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation