Score: 1

When Swin Transformer Meets KANs: An Improved Transformer Architecture for Medical Image Segmentation

Published: November 6, 2025 | arXiv ID: 2511.04084v1

By: Nishchal Sapkota , Haoyan Shi , Yejia Zhang and more

Potential Business Impact:

Helps doctors see inside bodies better with less data.

Business Areas:

Image Recognition Data and Analytics, Software

Medical image segmentation is critical for accurate diagnostics and treatment planning, but remains challenging due to complex anatomical structures and limited annotated training data. CNN-based segmentation methods excel at local feature extraction, but struggle with modeling long-range dependencies. Transformers, on the other hand, capture global context more effectively, but are inherently data-hungry and computationally expensive. In this work, we introduce UKAST, a U-Net like architecture that integrates rational-function based Kolmogorov-Arnold Networks (KANs) into Swin Transformer encoders. By leveraging rational base functions and Group Rational KANs (GR-KANs) from the Kolmogorov-Arnold Transformer (KAT), our architecture addresses the inefficiencies of vanilla spline-based KANs, yielding a more expressive and data-efficient framework with reduced FLOPs and only a very small increase in parameter count compared to SwinUNETR. UKAST achieves state-of-the-art performance on four diverse 2D and 3D medical image segmentation benchmarks, consistently surpassing both CNN- and Transformer-based baselines. Notably, it attains superior accuracy in data-scarce settings, alleviating the data-hungry limitations of standard Vision Transformers. These results show the potential of KAN-enhanced Transformers to advance data-efficient medical image segmentation. Code is available at: https://github.com/nsapkota417/UKAST

GroupKAN: Rethinking Nonlinearity with Grouped Spline-based KAN Modeling for Efficient Medical Image Segmentation

CV and Pattern Recognition

Helps doctors see inside bodies better, faster.

7 Nov 2025 1

89%

FunKAN: Functional Kolmogorov-Arnold Network for Medical Image Enhancement and Segmentation

CV and Pattern Recognition

Makes medical pictures clearer and finds diseases.

16 Sep 2025 2

89%

ViKANformer: Embedding Kolmogorov Arnold Networks in Vision Transformers for Pattern-Based Learning

CV and Pattern Recognition

Makes computer vision smarter by learning better.

3 Mar 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

5 pages

When Swin Transformer Meets KANs: An Improved Transformer Architecture for Medical Image Segmentation

Helps doctors see inside bodies better with less data.

Technical Abstract

GroupKAN: Rethinking Nonlinearity with Grouped Spline-based KAN Modeling for Efficient Medical Image Segmentation

FunKAN: Functional Kolmogorov-Arnold Network for Medical Image Enhancement and Segmentation

ViKANformer: Embedding Kolmogorov Arnold Networks in Vision Transformers for Pattern-Based Learning