SA-UNetv2: Rethinking Spatial Attention U-Net for Retinal Vessel Segmentation
By: Changlu Guo , Anders Nymark Christensen , Anders Bjorholm Dahl and more
Potential Business Impact:
Finds eye diseases faster and more easily.
Retinal vessel segmentation is essential for early diagnosis of diseases such as diabetic retinopathy, hypertension, and neurodegenerative disorders. Although SA-UNet introduces spatial attention in the bottleneck, it underuses attention in skip connections and does not address the severe foreground-background imbalance. We propose SA-UNetv2, a lightweight model that injects cross-scale spatial attention into all skip connections to strengthen multi-scale feature fusion and adopts a weighted Binary Cross-Entropy (BCE) plus Matthews Correlation Coefficient (MCC) loss to improve robustness to class imbalance. On the public DRIVE and STARE datasets, SA-UNetv2 achieves state-of-the-art performance with only 1.2MB memory and 0.26M parameters (less than 50% of SA-UNet), and 1 second CPU inference on 592 x 592 x 3 images, demonstrating strong efficiency and deployability in resource-constrained, CPU-only settings.
Similar Papers
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
Image and Video Processing
Finds eye problems by looking at blood vessels.
DB-KAUNet: An Adaptive Dual Branch Kolmogorov-Arnold UNet for Retinal Vessel Segmentation
CV and Pattern Recognition
Finds tiny eye blood vessels for better health checks.
NeuroVascU-Net: A Unified Multi-Scale and Cross-Domain Adaptive Feature Fusion U-Net for Precise 3D Segmentation of Brain Vessels in Contrast-Enhanced T1 MRI
CV and Pattern Recognition
Helps doctors see brain blood vessels better.