Score: 0

Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification

Published: January 12, 2026 | arXiv ID: 2601.07163v1

By: Shu Shen, C. L. Philip Chen, Tong Zhang

Reliable learning on low-quality multimodal data is a widely concerning issue, especially in safety-critical applications. However, multimodal noise poses a major challenge in this domain and leads existing methods to suffer from two key limitations. First, they struggle to reliably remove heterogeneous data noise, hindering robust multimodal representation learning. Second, they exhibit limited adaptability and generalization when encountering previously unseen noise. To address these issues, we propose Test-time Adaptive Hierarchical Co-enhanced Denoising Network (TAHCD). On one hand, TAHCD introduces the Adaptive Stable Subspace Alignment and Sample-Adaptive Confidence Alignment to reliably remove heterogeneous noise. They account for noise at both global and instance levels and enable jointly removal of modality-specific and cross-modality noise, achieving robust learning. On the other hand, TAHCD introduces test-time cooperative enhancement, which adaptively updates the model in response to input noise in a label-free manner, improving adaptability and generalization. This is achieved by collaboratively enhancing the joint removal process of modality-specific and cross-modality noise across global and instance levels according to sample noise. Experiments on multiple benchmarks demonstrate that the proposed method achieves superior classification performance, robustness, and generalization compared with state-of-the-art reliable multimodal learning approaches.

Open-World Test-Time Adaptation with Hierarchical Feature Aggregation and Attention Affine

CV and Pattern Recognition

Helps AI tell real from fake, even when surprised.

16 Nov 2025 0

87%

Beyond Distribution Shifts: Adaptive Hyperspectral Image Classification at Test Time

CV and Pattern Recognition

Makes image analysis work even with bad pictures.

10 Sep 2025 1

87%

Test-Time Adaptation for Video Highlight Detection Using Meta-Auxiliary Learning and Cross-Modality Hallucinations

CV and Pattern Recognition

Makes video highlight finders work better on new videos.

6 Aug 2025 1

View PDF Login to Bookmark

Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification

Technical Abstract

Open-World Test-Time Adaptation with Hierarchical Feature Aggregation and Attention Affine

Beyond Distribution Shifts: Adaptive Hyperspectral Image Classification at Test Time

Test-Time Adaptation for Video Highlight Detection Using Meta-Auxiliary Learning and Cross-Modality Hallucinations