Score: 2

Toward Open Earth Science as Fast and Accessible as Natural Language

Published: May 21, 2025 | arXiv ID: 2505.15690v2

By: Marquita Ellis , Iksha Gurung , Muthukumaran Ramasubramanian and more

BigTech Affiliations: IBM

Potential Business Impact:

Lets computers understand and analyze Earth pictures.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Is natural-language-driven earth observation data analysis now feasible with the assistance of Large Language Models (LLMs)? For open science in service of public interest, feasibility requires reliably high accuracy, interactive latencies, low (sustainable) costs, open LLMs, and openly maintainable software -- hence, the challenge. What are the techniques and programming system requirements necessary for satisfying these constraints, and what is the corresponding development and maintenance burden in practice? This study lays the groundwork for exploring these questions, introducing an impactful earth science use-case, and providing a software framework with evaluation data and metrics, along with initial results from employing model scaling, prompt-optimization, and inference-time scaling optimization techniques. While we attain high accuracy (near 100%) across 10 of 11 metrics, the analysis further considers cost (token-spend), latency, and maintainability across this space of techniques. Finally, we enumerate opportunities for further research, general programming and evaluation framework development, and ongoing work for a comprehensive, deployable solution. This is a call for collaboration and contribution.

Country of Origin
πŸ‡ΊπŸ‡Έ United States

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
Computational Engineering, Finance, and Science