Score: 0

PoolNet: Deep Learning for 2D to 3D Video Process Validation

Published: December 5, 2025 | arXiv ID: 2512.05362v1

By: Sanchit Kaul, Joseph Luna, Shray Arora

Lifting Structure-from-Motion (SfM) information from sequential and non-sequential image data is a time-consuming and computationally expensive task. In addition to this, the majority of publicly available data is unfit for processing due to inadequate camera pose variation, obscuring scene elements, and noisy data. To solve this problem, we introduce PoolNet, a versatile deep learning framework for frame-level and scene-level validation of in-the-wild data. We demonstrate that our model successfully differentiates SfM ready scenes from those unfit for processing while significantly undercutting the amount of time state of the art algorithms take to obtain structure-from-motion data.

3D Reconstruction via Incremental Structure From Motion

CV and Pattern Recognition

Builds 3D maps from scattered pictures.

1 Aug 2025 0

89%

CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes

CV and Pattern Recognition

Helps robots find their way from the sky.

3 Aug 2025 1

88%

Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction

CV and Pattern Recognition

Makes 3D pictures from photos for VR.

2 Sep 2025 0

View PDF Login to Bookmark

PoolNet: Deep Learning for 2D to 3D Video Process Validation

Technical Abstract

3D Reconstruction via Incremental Structure From Motion

CVD-SfM: A Cross-View Deep Front-end Structure-from-Motion System for Sparse Localization in Multi-Altitude Scenes

Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction