Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification
By: Jianxun Yu , Ruiquan Ge , Zhipeng Wang and more
Potential Business Impact:
Finds tiny sickness spots doctors miss.
The diagnosis of medical diseases faces challenges such as the misdiagnosis of small lesions. Deep learning, particularly multimodal approaches, has shown great potential in the field of medical disease diagnosis. However, the differences in dimensionality between medical imaging and electronic health record data present challenges for effective alignment and fusion. To address these issues, we propose the Multimodal Multiscale Cross-Attention Fusion Network (MMCAF-Net). This model employs a feature pyramid structure combined with an efficient 3D multi-scale convolutional attention module to extract lesion-specific features from 3D medical images. To further enhance multimodal data integration, MMCAF-Net incorporates a multi-scale cross-attention module, which resolves dimensional inconsistencies, enabling more effective feature fusion. We evaluated MMCAF-Net on the Lung-PET-CT-Dx dataset, and the results showed a significant improvement in diagnostic accuracy, surpassing current state-of-the-art methods. The code is available at https://github.com/yjx1234/MMCAF-Net
Similar Papers
Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation
CV and Pattern Recognition
Finds skin cancer spots more accurately.
MSAD-Net: Multiscale and Spatial Attention-based Dense Network for Lung Cancer Classification
CV and Pattern Recognition
Finds lung cancer in scans better than before.
Towards a Generalizable Fusion Architecture for Multimodal Object Detection
CV and Pattern Recognition
Helps cameras see better in fog and dark.