MMUEChange: A Generalized LLM Agent Framework for Intelligent Multi-Modal Urban Environment Change Analysis
By: Zixuan Xiao, Jun Ma, Siwei Zhang
Potential Business Impact:
Finds city changes from different data sources.
Understanding urban environment change is essential for sustainable development. However, current approaches, particularly remote sensing change detection, often rely on rigid, single-modal analysis. To overcome these limitations, we propose MMUEChange, a multi-modal agent framework that flexibly integrates heterogeneous urban data via a modular toolkit and a core module, Modality Controller for cross- and intra-modal alignment, enabling robust analysis of complex urban change scenarios. Case studies include: a shift toward small, community-focused parks in New York, reflecting local green space efforts; the spread of concentrated water pollution across districts in Hong Kong, pointing to coordinated water management; and a notable decline in open dumpsites in Shenzhen, with contrasting links between nighttime economic activity and waste types, indicating differing urban pressures behind domestic and construction waste. Compared to the best-performing baseline, the MMUEChange agent achieves a 46.7% improvement in task success rate and effectively mitigates hallucination, demonstrating its capacity to support complex urban change analysis tasks with real-world policy implications.
Similar Papers
LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery
Artificial Intelligence
Spots changes in pictures using smart computer vision.
Urban-MAS: Human-Centered Urban Prediction with LLM-Based Multi-Agent System
Multiagent Systems
Helps city computers predict crowds and traffic.
Multi-Agent Multimodal Large Language Model Framework for Automated Interpretation of Fuel Efficiency Analytics in Public Transportation
Artificial Intelligence
Helps buses use less fuel by explaining data.