Score: 0

Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels

Published: August 20, 2025 | arXiv ID: 2508.14767v1

By: Fabian Holst, Emre Gülsoylu, Simone Frintrop

Potential Business Impact:

Helps boats know their exact position and direction.

Business Areas:

Image Recognition Data and Analytics, Software

The paper presents a novel technique for creating a 6D pose estimation dataset for marine vessels by fusing monocular RGB images with Automatic Identification System (AIS) data. The proposed technique addresses the limitations of relying purely on AIS for location information, caused by issues like equipment reliability, data manipulation, and transmission delays. By combining vessel detections from monocular RGB images, obtained using an object detection network (YOLOX-X), with AIS messages, the technique generates 3D bounding boxes that represent the vessels' 6D poses, i.e. spatial and rotational dimensions. The paper evaluates different object detection models to locate vessels in image space. We also compare two transformation methods (homography and Perspective-n-Point) for aligning AIS data with image coordinates. The results of our work demonstrate that the Perspective-n-Point (PnP) method achieves a significantly lower projection error compared to homography-based approaches used before, and the YOLOX-X model achieves a mean Average Precision (mAP) of 0.80 at an Intersection over Union (IoU) threshold of 0.5 for relevant vessel classes. We show indication that our approach allows the creation of a 6D pose estimation dataset without needing manual annotation. Additionally, we introduce the Boats on Nordelbe Kehrwieder (BONK-pose), a publicly available dataset comprising 3753 images with 3D bounding box annotations for pose estimation, created by our data fusion approach. This dataset can be used for training and evaluating 6D pose estimation networks. In addition we introduce a set of 1000 images with 2D bounding box annotations for ship detection from the same scene.

Enhancing Maritime Domain Awareness on Inland Waterways: A YOLO-Based Fusion of Satellite and AIS for Vessel Characterization

CV and Pattern Recognition

Spots hidden boats on rivers using satellites.

13 Oct 2025 0

87%

Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation

CV and Pattern Recognition

Helps boats see better in fog and storms.

2 May 2025 0

87%

Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection

Image and Video Processing

Spots ships in ocean pictures better.

17 Mar 2025 0

View PDF Login to Bookmark

Page Count

13 pages

Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels

Helps boats know their exact position and direction.

Technical Abstract

Enhancing Maritime Domain Awareness on Inland Waterways: A YOLO-Based Fusion of Satellite and AIS for Vessel Characterization

Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation

Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection