FuseUNet: A Multi-Scale Feature Fusion Method for U-like Networks
By: Quansong He , Xiangde Min , Kaishen Wang and more
Potential Business Impact:
Helps doctors see inside bodies better.
Medical image segmentation is a critical task in computer vision, with UNet serving as a milestone architecture. The typical component of UNet family is the skip connection, however, their skip connections face two significant limitations: (1) they lack effective interaction between features at different scales, and (2) they rely on simple concatenation or addition operations, which constrain efficient information integration. While recent improvements to UNet have focused on enhancing encoder and decoder capabilities, these limitations remain overlooked. To overcome these challenges, we propose a novel multi-scale feature fusion method that reimagines the UNet decoding process as solving an initial value problem (IVP), treating skip connections as discrete nodes. By leveraging principles from the linear multistep method, we propose an adaptive ordinary differential equation method to enable effective multi-scale feature fusion. Our approach is independent of the encoder and decoder architectures, making it adaptable to various U-Net-like networks. Experiments on ACDC, KiTS2023, MSD brain tumor, and ISIC2017/2018 skin lesion segmentation datasets demonstrate improved feature utilization, reduced network parameters, and maintained high performance. The code is available at https://github.com/nayutayuki/FuseUNet.
Similar Papers
Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation
Image and Video Processing
Finds eye problems by looking at blood vessels.
Simple is what you need for efficient and accurate medical image segmentation
Image and Video Processing
Makes medical scans clearer with tiny computer programs.
Multi-Granularity Vision Fastformer with Fusion Mechanism for Skin Lesion Segmentation
CV and Pattern Recognition
Helps doctors find skin spots faster and cheaper.