LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery
By: Zixuan Xiao, Jun Ma
Potential Business Impact:
Spots changes in pictures using smart computer vision.
Existing change detection methods often lack the versatility to handle diverse real-world queries and the intelligence for comprehensive analysis. This paper presents a general agent framework, integrating Large Language Models (LLM) with vision foundation models to form ChangeGPT. A hierarchical structure is employed to mitigate hallucination. The agent was evaluated on a curated dataset of 140 questions categorized by real-world scenarios, encompassing various question types (e.g., Size, Class, Number) and complexities. The evaluation assessed the agent's tool selection ability (Precision/Recall) and overall query accuracy (Match). ChangeGPT, especially with a GPT-4-turbo backend, demonstrated superior performance, achieving a 90.71 % Match rate. Its strength lies particularly in handling change-related queries requiring multi-step reasoning and robust tool selection. Practical effectiveness was further validated through a real-world urban change monitoring case study in Qianhai Bay, Shenzhen. By providing intelligence, adaptability, and multi-type change analysis, ChangeGPT offers a powerful solution for decision-making in remote sensing applications.
Similar Papers
Vision-Language Agents for Interactive Forest Change Analysis
CV and Pattern Recognition
Helps scientists understand forest changes by asking questions.
Multi-Agent Multimodal Large Language Model Framework for Automated Interpretation of Fuel Efficiency Analytics in Public Transportation
Artificial Intelligence
Helps buses use less fuel by explaining data.
Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response
Artificial Intelligence
Helps robots plan better for fires and floods.