Score: 1

Effective Damage Data Generation by Fusing Imagery with Human Knowledge Using Vision-Language Models

Published: August 2, 2025 | arXiv ID: 2508.01380v1

By: Jie Wei , Erika Ardiles-Cruz , Aleksey Panasyuk and more

Potential Business Impact:

Helps rescue teams quickly see disaster damage.

It is of crucial importance to assess damages promptly and accurately in humanitarian assistance and disaster response (HADR). Current deep learning approaches struggle to generalize effectively due to the imbalance of data classes, scarcity of moderate damage examples, and human inaccuracy in pixel labeling during HADR situations. To accommodate for these limitations and exploit state-of-the-art techniques in vision-language models (VLMs) to fuse imagery with human knowledge understanding, there is an opportunity to generate a diversified set of image-based damage data effectively. Our initial experimental results suggest encouraging data generation quality, which demonstrates an improvement in classifying scenes with different levels of structural damage to buildings, roads, and infrastructures.

Automated Wildfire Damage Assessment from Multi view Ground level Imagery Via Vision Language Models

CV and Pattern Recognition

Helps quickly see fire damage from pictures.

2 Sep 2025 0

91%

Structural Damage Detection Using AI Super Resolution and Visual Language Model

CV and Pattern Recognition

Helps drones see damage after disasters.

23 Aug 2025 0

90%

DisasterInsight: A Multimodal Benchmark for Function-Aware and Grounded Disaster Assessment

CV and Pattern Recognition

Helps computers understand disaster damage from pictures.

26 Jan 2026 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

6 pages

Effective Damage Data Generation by Fusing Imagery with Human Knowledge Using Vision-Language Models

Helps rescue teams quickly see disaster damage.

Technical Abstract

Automated Wildfire Damage Assessment from Multi view Ground level Imagery Via Vision Language Models

Structural Damage Detection Using AI Super Resolution and Visual Language Model

DisasterInsight: A Multimodal Benchmark for Function-Aware and Grounded Disaster Assessment