Score: 3

Subimage Overlap Prediction: Task-Aligned Self-Supervised Pretraining For Semantic Segmentation In Remote Sensing Imagery

Published: January 5, 2026 | arXiv ID: 2601.01781v1

By: Lakshay Sharma, Alex Marin

BigTech Affiliations: University of Washington

Potential Business Impact:

Teaches computers to understand pictures with less data.

Business Areas:

Image Recognition Data and Analytics, Software

Self-supervised learning (SSL) methods have become a dominant paradigm for creating general purpose models whose capabilities can be transferred to downstream supervised learning tasks. However, most such methods rely on vast amounts of pretraining data. This work introduces Subimage Overlap Prediction, a novel self-supervised pretraining task to aid semantic segmentation in remote sensing imagery that uses significantly lesser pretraining imagery. Given an image, a sub-image is extracted and the model is trained to produce a semantic mask of the location of the extracted sub-image within the original image. We demonstrate that pretraining with this task results in significantly faster convergence, and equal or better performance (measured via mIoU) on downstream segmentation. This gap in convergence and performance widens when labeled training data is reduced. We show this across multiple architecture types, and with multiple downstream datasets. We also show that our method matches or exceeds performance while requiring significantly lesser pretraining data relative to other SSL methods. Code and model weights are provided at \href{https://github.com/sharmalakshay93/subimage-overlap-prediction}{github.com/sharmalakshay93/subimage-overlap-prediction}.

Cross-Scale Pretraining: Enhancing Self-Supervised Learning for Low-Resolution Satellite Imagery for Semantic Segmentation

CV and Pattern Recognition

Makes satellite pictures clearer for better maps.

19 Jan 2026 0

89%

Scale-Aware Self-Supervised Learning for Segmentation of Small and Sparse Structures

CV and Pattern Recognition

Helps computers see tiny things in pictures.

26 Jan 2026 2

89%

Is Self-Supervised Pre-training on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-2

CV and Pattern Recognition

Earth pictures train computers better than cat pictures.

15 Feb 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

10 pages

Subimage Overlap Prediction: Task-Aligned Self-Supervised Pretraining For Semantic Segmentation In Remote Sensing Imagery

Teaches computers to understand pictures with less data.

Technical Abstract

Cross-Scale Pretraining: Enhancing Self-Supervised Learning for Low-Resolution Satellite Imagery for Semantic Segmentation

Scale-Aware Self-Supervised Learning for Segmentation of Small and Sparse Structures

Is Self-Supervised Pre-training on Satellite Imagery Better than ImageNet? A Systematic Study with Sentinel-2