Multi-Attention Stacked Ensemble for Lung Cancer Detection in CT Scans
By: Uzzal Saha, Surya Prakash
Potential Business Impact:
Helps doctors find lung cancer faster and better.
In this work, we address the challenge of binary lung nodule classification (benign vs malignant) using CT images by proposing a multi-level attention stacked ensemble of deep neural networks. Three pretrained backbones -- EfficientNet V2 S, MobileViT XXS, and DenseNet201 -- are each adapted with a custom classification head tailored to 96 x 96 pixel inputs. A two-stage attention mechanism learns both model-wise and class-wise importance scores from concatenated logits, and a lightweight meta-learner refines the final prediction. To mitigate class imbalance and improve generalization, we employ dynamic focal loss with empirically calculated class weights, MixUp augmentation during training, and test-time augmentation at inference. Experiments on the LIDC-IDRI dataset demonstrate exceptional performance, achieving 98.09 accuracy and 0.9961 AUC, representing a 35 percent reduction in error rate compared to state-of-the-art methods. The model exhibits balanced performance across sensitivity (98.73) and specificity (98.96), with particularly strong results on challenging cases where radiologist disagreement was high. Statistical significance testing confirms the robustness of these improvements across multiple experimental runs. Our approach can serve as a robust, automated aid for radiologists in lung cancer screening.
Similar Papers
Advanced Deep Learning Techniques for Accurate Lung Cancer Detection and Classification
Image and Video Processing
Finds lung cancer in scans with high accuracy.
A Hybrid Deep Learning Framework with Explainable AI for Lung Cancer Classification with DenseNet169 and SVM
CV and Pattern Recognition
Finds lung cancer on scans faster, more accurately.
MSAD-Net: Multiscale and Spatial Attention-based Dense Network for Lung Cancer Classification
CV and Pattern Recognition
Finds lung cancer in scans better than before.