ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining
By: Zhenyang Huang , Xiao Yu , Yi Zhang and more
Remote sensing image change detection is one of the fundamental tasks in remote sensing intelligent interpretation. Its core objective is to identify changes within change regions of interest (CRoI). Current multimodal large models encode rich human semantic knowledge, which is utilized for guidance in tasks such as remote sensing change detection. However, existing methods that use semantic guidance for detecting users' CRoI overly rely on explicit textual descriptions of CRoI, leading to the problem of near-complete performance failure when presented with implicit CRoI textual descriptions. This paper proposes a multimodal reasoning change detection model named ReasonCD, capable of mining users' implicit task intent. The model leverages the powerful reasoning capabilities of pre-trained large language models to mine users' implicit task intents and subsequently obtains different change detection results based on these intents. Experiments on public datasets demonstrate that the model achieves excellent change detection performance, with an F1 score of 92.1\% on the BCDD dataset. Furthermore, to validate its superior reasoning functionality, this paper annotates a subset of reasoning data based on the SECOND dataset. Experimental results show that the model not only excels at basic reasoning-based change detection tasks but can also explain the reasoning process to aid human decision-making.
Similar Papers
Referring Change Detection in Remote Sensing Imagery
CV and Pattern Recognition
Finds specific changes in pictures using words.
Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting
CV and Pattern Recognition
Finds what changed in satellite pictures.
CSD: Change Semantic Detection with only Semantic Change Masks for Damage Assessment in Conflict Zones
CV and Pattern Recognition
Spots building damage from space quickly.