Score: 2

Multi-Agent Monocular Dense SLAM With 3D Reconstruction Priors

Published: November 24, 2025 | arXiv ID: 2511.19031v1

By: Haihang Wu, Yuchen Zhou

BigTech Affiliations: Weibo

Potential Business Impact:

Robots map spaces faster with multiple cameras.

Business Areas:
Robotics Hardware, Science and Engineering, Software

Monocular Simultaneous Localization and Mapping (SLAM) aims to estimate a robot's pose while simultaneously reconstructing an unknown 3D scene using a single camera. While existing monocular SLAM systems generate detailed 3D geometry through dense scene representations, they are computationally expensive due to the need for iterative optimization. To address this challenge, MASt3R-SLAM utilizes learned 3D reconstruction priors, enabling more efficient and accurate estimation of both 3D structures and camera poses. However, MASt3R-SLAM is limited to single-agent operation. In this paper, we extend MASt3R-SLAM to introduce the first multi-agent monocular dense SLAM system. Each agent performs local SLAM using a 3D reconstruction prior, and their individual maps are fused into a globally consistent map through a loop-closure-based map fusion mechanism. Our approach improves computational efficiency compared to state-of-the-art methods, while maintaining similar mapping accuracy when evaluated on real-world datasets.

Country of Origin
🇨🇳 China

Page Count
6 pages

Category
Computer Science:
Robotics