SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection
By: Yao Wang , Dong Yang , Zhi Qiao and more
Potential Business Impact:
Finds sickness in medical pictures faster.
Abnormality detection in medical imaging is a critical task requiring both high efficiency and accuracy to support effective diagnosis. While convolutional neural networks (CNNs) and Transformer-based models are widely used, both face intrinsic challenges: CNNs have limited receptive fields, restricting their ability to capture broad contextual information, and Transformers encounter prohibitive computational costs when processing high-resolution medical images. Mamba, a recent innovation in natural language processing, has gained attention for its ability to process long sequences with linear complexity, offering a promising alternative. Building on this foundation, we present SpectMamba, the first Mamba-based architecture designed for medical image detection. A key component of SpectMamba is the Hybrid Spatial-Frequency Attention (HSFA) block, which separately learns high- and low-frequency features. This approach effectively mitigates the loss of high-frequency information caused by frequency bias and correlates frequency-domain features with spatial features, thereby enhancing the model's ability to capture global context. To further improve long-range dependencies, we propose the Visual State-Space Module (VSSM) and introduce a novel Hilbert Curve Scanning technique to strengthen spatial correlations and local dependencies, further optimizing the Mamba framework. Comprehensive experiments show that SpectMamba achieves state-of-the-art performance while being both effective and efficient across various medical image detection tasks.
Similar Papers
Versatile and Efficient Medical Image Super-Resolution Via Frequency-Gated Mamba
CV and Pattern Recognition
Makes blurry medical pictures sharp for doctors.
SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation
CV and Pattern Recognition
Helps doctors see tiny details in medical scans.
Hyperspectral Mamba for Hyperspectral Object Tracking
CV and Pattern Recognition
Tracks objects better using special light colors.