Score: 0

Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation

Published: October 6, 2025 | arXiv ID: 2510.05266v1

By: Christina Thrainer , Md Meftahul Ferdaus , Mahdi Abdelguerfi and more

Potential Business Impact:

Finds pipe problems with few pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Few-shot semantic segmentation is vital for deep learning-based infrastructure inspection applications, where labeled training examples are scarce and expensive. Although existing deep learning frameworks perform well, the need for extensive labeled datasets and the inability to learn new defect categories with little data are problematic. We present our Enhanced Feature Pyramid Network (E-FPN) framework for few-shot semantic segmentation of culvert and sewer defect categories using a prototypical learning framework. Our approach has three main contributions: (1) adaptive E-FPN encoder using InceptionSepConv blocks and depth-wise separable convolutions for efficient multi-scale feature extraction; (2) prototypical learning with masked average pooling for powerful prototype generation from small support examples; and (3) attention-based feature representation through global self-attention, local self-attention and cross-attention. Comprehensive experimentation on challenging infrastructure inspection datasets illustrates that the method achieves excellent few-shot performance, with the best configuration being 8-way 5-shot training configuration at 82.55% F1-score and 72.26% mIoU in 2-way classification testing. The self-attention method had the most significant performance improvements, providing 2.57% F1-score and 2.9% mIoU gain over baselines. Our framework addresses the critical need to rapidly respond to new defect types in infrastructure inspection systems with limited new training data that lead to more efficient and economical maintenance plans for critical infrastructure systems.

FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images

CV and Pattern Recognition

Helps computers identify objects in murky underwater pictures.

1 Apr 2025 1

87%

Deep Learning Framework for Infrastructure Maintenance: Crack Detection and High-Resolution Imaging of Infrastructure Surfaces

CV and Pattern Recognition

Makes drone pictures of bridges clearer for repairs.

6 May 2025 0

87%

Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation

Image and Video Processing

Finds eye problems by looking at blood vessels.

2 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

10 pages

Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation

Finds pipe problems with few pictures.

Technical Abstract

FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images

Deep Learning Framework for Infrastructure Maintenance: Crack Detection and High-Resolution Imaging of Infrastructure Surfaces

Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation