Score: 2

Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks

Published: November 17, 2025 | arXiv ID: 2511.13145v1

By: Cesar Portocarrero Rodriguez, Laura Vandeweyen, Yosuke Yamamoto

BigTech Affiliations: Stanford University

Potential Business Impact:

Finds road cracks faster using smart computer eyes.

Business Areas:

Image Recognition Data and Analytics, Software

The American Society of Civil Engineers has graded Americas infrastructure condition as a C, with the road system receiving a dismal D. Roads are vital to regional economic viability, yet their management, maintenance, and repair processes remain inefficient, relying on outdated manual or laser-based inspection methods that are both costly and time-consuming. With the increasing availability of real-time visual data from autonomous vehicles, there is an opportunity to apply computer vision (CV) methods for advanced road monitoring, providing insights to guide infrastructure rehabilitation efforts. This project explores the use of state-of-the-art CV techniques for road distress segmentation. It begins by evaluating synthetic data generated with Generative Adversarial Networks (GANs) to assess its usefulness for model training. The study then applies Convolutional Neural Networks (CNNs) for road distress segmentation and subsequently examines the transformer-based model MaskFormer. Results show that GAN-generated data improves model performance and that MaskFormer outperforms the CNN model in two metrics: mAP50 and IoU.