Score: 1

Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition

Published: December 5, 2025 | arXiv ID: 2512.05936v1

By: Anne Sielemann , Lena Loercher , Max-Lion Schumacher and more

Potential Business Impact:

Makes self-driving cars better at seeing signs.

Business Areas:
Image Recognition Data and Analytics, Software

In this paper, we present a synthesis pipeline and dataset for training / testing data in the task of traffic sign recognition that combines the advantages of data-driven and analytical modeling: GAN-based texture generation enables data-driven dirt and wear artifacts, rendering unique and realistic traffic sign surfaces, while the analytical scene modulation achieves physically correct lighting and allows detailed parameterization. In particular, the latter opens up applications in the context of explainable AI (XAI) and robustness tests due to the possibility of evaluating the sensitivity to parameter changes, which we demonstrate with experiments. Our resulting synthetic traffic sign recognition dataset Synset Signset Germany contains a total of 105500 images of 211 different German traffic sign classes, including newly published (2020) and thus comparatively rare traffic signs. In addition to a mask and a segmentation image, we also provide extensive metadata including the stochastically selected environment and imaging effect parameters for each image. We evaluate the degree of realism of Synset Signset Germany on the real-world German Traffic Sign Recognition Benchmark (GTSRB) and in comparison to CATERED, a state-of-the-art synthetic traffic sign recognition dataset.

Page Count
8 pages

Category
Computer Science:
CV and Pattern Recognition