The Role of Noisy Data in Improving CNN Robustness for Image Classification
By: Oscar H. Ramírez-Agudelo , Nicoleta Gorea , Aliza Reif and more
Data quality plays a central role in the performance and robustness of convolutional neural networks (CNNs) for image classification. While high-quality data is often preferred for training, real-world inputs are frequently affected by noise and other distortions. This paper investigates the effect of deliberately introducing controlled noise into the training data to improve model robustness. Using the CIFAR-10 dataset, we evaluate the impact of three common corruptions, namely Gaussian noise, Salt-and-Pepper noise, and Gaussian blur at varying intensities and training set pollution levels. Experiments using a Resnet-18 model reveal that incorporating just 10\% noisy data during training is sufficient to significantly reduce test loss and enhance accuracy under fully corrupted test conditions, with minimal impact on clean-data performance. These findings suggest that strategic exposure to noise can act as a simple yet effective regularizer, offering a practical trade-off between traditional data cleanliness and real-world resilience.
Similar Papers
Noisy Label Refinement with Semantically Reliable Synthetic Images
CV and Pattern Recognition
Fixes computer vision mistakes using fake pictures.
On the Role of Label Noise in the Feature Learning Process
Machine Learning (Stat)
Helps computers learn better even with wrong answers.
Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions
CV and Pattern Recognition
Makes smart cameras work better in bad weather.