Dual Interaction Network with Cross-Image Attention for Medical Image Segmentation
By: Jeonghyun Noh, Wangsu Jeon, Jinsun Park
Potential Business Impact:
Improves medical scans for better disease detection.
Medical image segmentation is a crucial method for assisting professionals in diagnosing various diseases through medical imaging. However, various factors such as noise, blurriness, and low contrast often hinder the accurate diagnosis of diseases. While numerous image enhancement techniques can mitigate these issues, they may also alter crucial information needed for accurate diagnosis in the original image. Conventional image fusion strategies, such as feature concatenation can address this challenge. However, they struggle to fully leverage the advantages of both original and enhanced images while suppressing the side effects of the enhancements. To overcome the problem, we propose a dual interactive fusion module (DIFM) that effectively exploits mutual complementary information from the original and enhanced images. DIFM employs cross-attention bidirectionally to simultaneously attend to corresponding spatial information across different images, subsequently refining the complementary features via global spatial attention. This interaction leverages low- to high-level features implicitly associated with diverse structural attributes like edges, blobs, and object shapes, resulting in enhanced features that embody important spatial characteristics. In addition, we introduce a multi-scale boundary loss based on gradient extraction to improve segmentation accuracy at object boundaries. Experimental results on the ACDC and Synapse datasets demonstrate the superiority of the proposed method quantitatively and qualitatively. Code available at: https://github.com/JJeong-Gari/DIN
Similar Papers
DM-FNet: Unified multimodal medical image fusion via diffusion process-trained encoder-decoder
CV and Pattern Recognition
Makes medical scans clearer for better diagnoses.
DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification
CV and Pattern Recognition
Helps doctors find eye sickness by comparing both eyes.
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
Image and Video Processing
Helps doctors see diseases in X-rays better.