Score: 3

SA-UNetv2: Rethinking Spatial Attention U-Net for Retinal Vessel Segmentation

Published: September 15, 2025 | arXiv ID: 2509.11774v1

By: Changlu Guo , Anders Nymark Christensen , Anders Bjorholm Dahl and more

Potential Business Impact:

Finds eye diseases faster and more easily.

Business Areas:
Image Recognition Data and Analytics, Software

Retinal vessel segmentation is essential for early diagnosis of diseases such as diabetic retinopathy, hypertension, and neurodegenerative disorders. Although SA-UNet introduces spatial attention in the bottleneck, it underuses attention in skip connections and does not address the severe foreground-background imbalance. We propose SA-UNetv2, a lightweight model that injects cross-scale spatial attention into all skip connections to strengthen multi-scale feature fusion and adopts a weighted Binary Cross-Entropy (BCE) plus Matthews Correlation Coefficient (MCC) loss to improve robustness to class imbalance. On the public DRIVE and STARE datasets, SA-UNetv2 achieves state-of-the-art performance with only 1.2MB memory and 0.26M parameters (less than 50% of SA-UNet), and 1 second CPU inference on 592 x 592 x 3 images, demonstrating strong efficiency and deployability in resource-constrained, CPU-only settings.

Country of Origin
🇨🇳 🇩🇰 Denmark, China

Repos / Data Links

Page Count
5 pages

Category
Computer Science:
CV and Pattern Recognition