Score: 0

Masked Feature Modeling Enhances Adaptive Segmentation

Published: September 17, 2025 | arXiv ID: 2509.13801v1

By: Wenlve Zhou , Zhiheng Zhou , Tiantao Xian and more

Potential Business Impact:

Teaches computers to see in new places.

Business Areas:

Image Recognition Data and Analytics, Software

Unsupervised domain adaptation (UDA) for semantic segmentation aims to transfer models from a labeled source domain to an unlabeled target domain. While auxiliary self-supervised tasks-particularly contrastive learning-have improved feature discriminability, masked modeling approaches remain underexplored in this setting, largely due to architectural incompatibility and misaligned optimization objectives. We propose Masked Feature Modeling (MFM), a novel auxiliary task that performs feature masking and reconstruction directly in the feature space. Unlike existing masked modeling methods that reconstruct low-level inputs or perceptual features (e.g., HOG or visual tokens), MFM aligns its learning target with the main segmentation task, ensuring compatibility with standard architectures like DeepLab and DAFormer without modifying the inference pipeline. To facilitate effective reconstruction, we introduce a lightweight auxiliary module, Rebuilder, which is trained jointly but discarded during inference, adding zero computational overhead at test time. Crucially, MFM leverages the segmentation decoder to classify the reconstructed features, tightly coupling the auxiliary objective with the pixel-wise prediction task to avoid interference with the primary task. Extensive experiments across various architectures and UDA benchmarks demonstrate that MFM consistently enhances segmentation performance, offering a simple, efficient, and generalizable strategy for unsupervised domain-adaptive semantic segmentation.

OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation

CV and Pattern Recognition

Helps computers see in new places without new labels.

13 Dec 2025 1

89%

VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation

CV and Pattern Recognition

Helps computers learn from pictures better with less data.

11 Mar 2025 2

89%

MFM-DA: Instance-Aware Adaptor and Hierarchical Alignment for Efficient Domain Adaptation in Medical Foundation Models

CV and Pattern Recognition

Helps AI doctors see eye problems better.

2 Mar 2025 1

View PDF Login to Bookmark

Page Count

9 pages

Masked Feature Modeling Enhances Adaptive Segmentation

Teaches computers to see in new places.

Technical Abstract

OMUDA: Omni-level Masking for Unsupervised Domain Adaptation in Semantic Segmentation

VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation

MFM-DA: Instance-Aware Adaptor and Hierarchical Alignment for Efficient Domain Adaptation in Medical Foundation Models