Score: 1

A Satellite-Ground Synergistic Large Vision-Language Model System for Earth Observation

Published: July 8, 2025 | arXiv ID: 2507.05731v1

By: Yuxin Zhang , Jiahao Yang , Zhe Chen and more

Potential Business Impact:

Helps satellites see Earth faster, saving data.

Business Areas:

GPS Hardware, Navigation and Mapping

Recently, large vision-language models (LVLMs) unleash powerful analysis capabilities for low Earth orbit (LEO) satellite Earth observation images in the data center. However, fast satellite motion, brief satellite-ground station (GS) contact windows, and large size of the images pose a data download challenge. To enable near real-time Earth observation applications (e.g., disaster and extreme weather monitoring), we should explore how to deploy LVLM in LEO satellite networks, and design SpaceVerse, an efficient satellite-ground synergistic LVLM inference system. To this end, firstly, we deploy compact LVLMs on satellites for lightweight tasks, whereas regular LVLMs operate on GSs to handle computationally intensive tasks. Then, we propose a computing and communication co-design framework comprised of a progressive confidence network and an attention-based multi-scale preprocessing, used to identify on-satellite inferring data, and reduce data redundancy before satellite-GS transmission, separately. We implement and evaluate SpaceVerse on real-world LEO satellite constellations and datasets, achieving a 31.2% average gain in accuracy and a 51.2% reduction in latency compared to state-of-the-art baselines.

Enabling Near-realtime Remote Sensing via Satellite-Ground Collaboration of Large Vision-Language Models

Networking and Internet Architecture

Lets satellites understand pictures faster.

28 Oct 2025 1

91%

SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

CV and Pattern Recognition

Finds things in satellite pictures using words.

9 Dec 2025 1

90%

GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing

CV and Pattern Recognition

Helps computers understand satellite pictures better.

16 Mar 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

11 pages

A Satellite-Ground Synergistic Large Vision-Language Model System for Earth Observation

Helps satellites see Earth faster, saving data.

Technical Abstract

Enabling Near-realtime Remote Sensing via Satellite-Ground Collaboration of Large Vision-Language Models

SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing