Synthetic-to-Real Object Detection using YOLOv11 and Domain Randomization Strategies
By: Luisa Torquato Niño, Hamza A. A. Gardi
Potential Business Impact:
Teaches computers to see real things using fake pictures.
This paper addresses the synthetic-to-real domain gap in object detection, focusing on training a YOLOv11 model to detect a specific object (a soup can) using only synthetic data and domain randomization strategies. The methodology involves extensive experimentation with data augmentation, dataset composition, and model scaling. While synthetic validation metrics were consistently high, they proved to be poor predictors of real-world performance. Consequently, models were also evaluated qualitatively, through visual inspection of predictions, and quantitatively, on a manually labeled real-world test set, to guide development. Final mAP@50 scores were provided by the official Kaggle competition. Key findings indicate that increasing synthetic dataset diversity, specifically by including varied perspectives and complex backgrounds, combined with carefully tuned data augmentation, were crucial in bridging the domain gap. The best performing configuration, a YOLOv11l model trained on an expanded and diverse dataset, achieved a final mAP@50 of 0.910 on the competition's hidden test set. This result demonstrates the potential of a synthetic-only training approach while also highlighting the remaining challenges in fully capturing real-world variability.
Similar Papers
Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study
CV and Pattern Recognition
Teaches robots to see and grab parts.
A Synthetic Dataset for Manometry Recognition in Robotic Applications
CV and Pattern Recognition
Creates fake pictures to train robots for dangerous jobs.
Exploring Syn-to-Real Domain Adaptation for Military Target Detection
CV and Pattern Recognition
Makes cameras find military targets in new places.