Improving Diagnostic Performance on Small and Imbalanced Datasets Using Class-Based Input Image Composition
By: Hlali Azzeddine, Majid Ben Yakhlef, Soulaiman El Hazzat
Potential Business Impact:
Improves disease detection from blurry eye scans.
Small, imbalanced datasets and poor input image quality can lead to high false predictions rates with deep learning models. This paper introduces Class-Based Image Composition, an approach that allows us to reformulate training inputs through a fusion of multiple images of the same class into combined visual composites, named Composite Input Images (CoImg). That enhances the intra-class variance and improves the valuable information density per training sample and increases the ability of the model to distinguish between subtle disease patterns. Our method was evaluated on the Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods (OCTDL) (Kulyabin et al., 2024), which contains 2,064 high-resolution optical coherence tomography (OCT) scans of the human retina, representing seven distinct diseases with a significant class imbalance. We constructed a perfectly class-balanced version of this dataset, named Co-OCTDL, where each scan is resented as a 3x1 layout composite image. To assess the effectiveness of this new representation, we conducted a comparative analysis between the original dataset and its variant using a VGG16 model. A fair comparison was ensured by utilizing the identical model architecture and hyperparameters for all experiments. The proposed approach markedly improved diagnostic results.The enhanced Dataset achieved near-perfect accuracy (99.6%) with F1-score (0.995) and AUC (0.9996), compared to a baseline model trained on raw dataset. The false prediction rate was also significantly lower, this demonstrates that the method can producehigh-quality predictions even for weak datasets affected by class imbalance or small sample size.
Similar Papers
Improving Diagnostic Performance on Small and Imbalanced Datasets Using Class-Based Input Image Composition
CV and Pattern Recognition
Makes AI better at spotting diseases in eye scans.
MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
CV and Pattern Recognition
Makes AI combine many pictures into one.
Instance-Level Composed Image Retrieval
CV and Pattern Recognition
Helps computers find specific objects in pictures.