Score: 0

TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image

Published: March 17, 2025 | arXiv ID: 2503.12779v1

By: Haoxiao Wang , Kaichen Zhou , Binrui Gu and more

Potential Business Impact:

Helps robots grab see-through objects accurately.

Business Areas:

3D Printing Manufacturing

Manipulating transparent objects presents significant challenges due to the complexities introduced by their reflection and refraction properties, which considerably hinder the accurate estimation of their 3D shapes. To address these challenges, we propose a single-view RGB-D-based depth completion framework, TransDiff, that leverages the Denoising Diffusion Probabilistic Models(DDPM) to achieve material-agnostic object grasping in desktop. Specifically, we leverage features extracted from RGB images, including semantic segmentation, edge maps, and normal maps, to condition the depth map generation process. Our method learns an iterative denoising process that transforms a random depth distribution into a depth map, guided by initially refined depth information, ensuring more accurate depth estimation in scenarios involving transparent objects. Additionally, we propose a novel training method to better align the noisy depth and RGB image features, which are used as conditions to refine depth estimation step by step. Finally, we utilized an improved inference process to accelerate the denoising procedure. Through comprehensive experimental validation, we demonstrate that our method significantly outperforms the baselines in both synthetic and real-world benchmarks with acceptable inference time. The demo of our method can be found on https://wang-haoxiao.github.io/TransDiff/

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

CV and Pattern Recognition

Helps robots see through glass and reflections.

29 Dec 2025 1

89%

TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update

Robotics

Shows 3D shape of see-through things from pictures.

15 Jul 2025 1

88%

PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes

CV and Pattern Recognition

Makes computers understand pictures better without special cameras.

24 Mar 2025 1

View PDF Login to Bookmark

Page Count

7 pages

TransDiff: Diffusion-Based Method for Manipulating Transparent Objects Using a Single RGB-D Image

Helps robots grab see-through objects accurately.

Technical Abstract

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update

PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes