Score: 0

Training a Custom CNN on Five Heterogeneous Image Datasets

Published: January 8, 2026 | arXiv ID: 2601.04727v1

By: Anika Tabassum, Tasnuva Mahazabin Tuba, Nafisa Naznin

Potential Business Impact:

Helps computers identify things in pictures better.

Business Areas:

Image Recognition Data and Analytics, Software

Deep learning has transformed visual data analysis, with Convolutional Neural Networks (CNNs) becoming highly effective in learning meaningful feature representations directly from images. Unlike traditional manual feature engineering methods, CNNs automatically extract hierarchical visual patterns, enabling strong performance across diverse real-world contexts. This study investigates the effectiveness of CNN-based architectures across five heterogeneous datasets spanning agricultural and urban domains: mango variety classification, paddy variety identification, road surface condition assessment, auto-rickshaw detection, and footpath encroachment monitoring. These datasets introduce varying challenges, including differences in illumination, resolution, environmental complexity, and class imbalance, necessitating adaptable and robust learning models. We evaluate a lightweight, task-specific custom CNN alongside established deep architectures, including ResNet-18 and VGG-16, trained both from scratch and using transfer learning. Through systematic preprocessing, augmentation, and controlled experimentation, we analyze how architectural complexity, model depth, and pre-training influence convergence, generalization, and performance across datasets of differing scale and difficulty. The key contributions of this work are: (1) the development of an efficient custom CNN that achieves competitive performance across multiple application domains, and (2) a comprehensive comparative analysis highlighting when transfer learning and deep architectures provide substantial advantages, particularly in data-constrained environments. These findings offer practical insights for deploying deep learning models in resource-limited yet high-impact real-world visual classification tasks.

A Comparative Study of Custom CNNs, Pre-trained Models, and Transfer Learning Across Multiple Visual Datasets

CV and Pattern Recognition

Transfer learning finds best image matches.

5 Jan 2026 1

93%

Comparative Analysis of Custom CNN Architectures versus Pre-trained Models and Transfer Learning: A Study on Five Bangladesh Datasets

CV and Pattern Recognition

Teaches computers to spot road damage perfectly.

7 Jan 2026 0

93%

Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks

CV and Pattern Recognition

Builds smarter computer vision for different tasks.

3 Jan 2026 0

View PDF Login to Bookmark

Country of Origin

🇧🇩 Bangladesh

Page Count

24 pages

Training a Custom CNN on Five Heterogeneous Image Datasets

Helps computers identify things in pictures better.

Technical Abstract

A Comparative Study of Custom CNNs, Pre-trained Models, and Transfer Learning Across Multiple Visual Datasets

Comparative Analysis of Custom CNN Architectures versus Pre-trained Models and Transfer Learning: A Study on Five Bangladesh Datasets

Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks