Deep Attention-guided Adaptive Subsampling
By: Sharath M Shankaranarayana , Soumava Kumar Roy , Prasad Sudhakar and more
Potential Business Impact:
Makes smart computers work faster and cheaper.
Although deep neural networks have provided impressive gains in performance, these improvements often come at the cost of increased computational complexity and expense. In many cases, such as 3D volume or video classification tasks, not all slices or frames are necessary due to inherent redundancies. To address this issue, we propose a novel learnable subsampling framework that can be integrated into any neural network architecture. Subsampling, being a nondifferentiable operation, poses significant challenges for direct adaptation into deep learning models. While some works, have proposed solutions using the Gumbel-max trick to overcome the problem of non-differentiability, they fall short in a crucial aspect: they are only task-adaptive and not inputadaptive. Once the sampling mechanism is learned, it remains static and does not adjust to different inputs, making it unsuitable for real-world applications. To this end, we propose an attention-guided sampling module that adapts to inputs even during inference. This dynamic adaptation results in performance gains and reduces complexity in deep neural network models. We demonstrate the effectiveness of our method on 3D medical imaging datasets from MedMNIST3D as well as two ultrasound video datasets for classification tasks, one of them being a challenging in-house dataset collected under real-world clinical conditions.
Similar Papers
Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus
CV and Pattern Recognition
Finds hidden eye disease earlier in scans.
DRL-Guided Neural Batch Sampling for Semi-Supervised Pixel-Level Anomaly Detection
CV and Pattern Recognition
Finds tiny flaws in factory products using smart learning.
IntelliCap: Intelligent Guidance for Consistent View Sampling
CV and Pattern Recognition
Guides cameras to take perfect pictures for 3D scenes.