Score: 2

Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective

Published: May 7, 2025 | arXiv ID: 2505.04758v1

By: Songsong Duan , Xi Yang , Nannan Wang and more

Potential Business Impact:

Makes computers see important things faster.

Business Areas:

Image Recognition Data and Analytics, Software

Current RGB-D methods usually leverage large-scale backbones to improve accuracy but sacrifice efficiency. Meanwhile, several existing lightweight methods are difficult to achieve high-precision performance. To balance the efficiency and performance, we propose a Speed-Accuracy Tradeoff Network (SATNet) for Lightweight RGB-D SOD from three fundamental perspectives: depth quality, modality fusion, and feature representation. Concerning depth quality, we introduce the Depth Anything Model to generate high-quality depth maps,which effectively alleviates the multi-modal gaps in the current datasets. For modality fusion, we propose a Decoupled Attention Module (DAM) to explore the consistency within and between modalities. Here, the multi-modal features are decoupled into dual-view feature vectors to project discriminable information of feature maps. For feature representation, we develop a Dual Information Representation Module (DIRM) with a bi-directional inverted framework to enlarge the limited feature space generated by the lightweight backbones. DIRM models texture features and saliency features to enrich feature space, and employ two-way prediction heads to optimal its parameters through a bi-directional backpropagation. Finally, we design a Dual Feature Aggregation Module (DFAM) in the decoder to aggregate texture and saliency features. Extensive experiments on five public RGB-D SOD datasets indicate that the proposed SATNet excels state-of-the-art (SOTA) CNN-based heavyweight models and achieves a lightweight framework with 5.2 M parameters and 415 FPS.

SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images

CV and Pattern Recognition

Helps robots see important things in pictures.

4 Mar 2025 1

89%

DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection

CV and Pattern Recognition

Finds important things in pictures faster.

24 Nov 2025 2

88%

SAM-DAQ: Segment Anything Model with Depth-guided Adaptive Queries for RGB-D Video Salient Object Detection

CV and Pattern Recognition

Helps computers find moving objects in videos.

13 Nov 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

15 pages

Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective

Makes computers see important things faster.

Technical Abstract

SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images

DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection

SAM-DAQ: Segment Anything Model with Depth-guided Adaptive Queries for RGB-D Video Salient Object Detection