InferA: A Smart Assistant for Cosmological Ensemble Data
By: Justin Z. Tam , Pascal Grosset , Divya Banesh and more
Potential Business Impact:
Helps scientists analyze huge computer simulations faster.
Analyzing large-scale scientific datasets presents substantial challenges due to their sheer volume, structural complexity, and the need for specialized domain knowledge. Automation tools, such as PandasAI, typically require full data ingestion and lack context of the full data structure, making them impractical as intelligent data analysis assistants for datasets at the terabyte scale. To overcome these limitations, we propose InferA, a multi-agent system that leverages large language models to enable scalable and efficient scientific data analysis. At the core of the architecture is a supervisor agent that orchestrates a team of specialized agents responsible for distinct phases of the data retrieval and analysis. The system engages interactively with users to elicit their analytical intent and confirm query objectives, ensuring alignment between user goals and system actions. To demonstrate the framework's usability, we evaluate the system using ensemble runs from the HACC cosmology simulation which comprises several terabytes.
Similar Papers
Towards Efficient Agents: A Co-Design of Inference Architecture and System
Computation and Language
Makes AI agents think and act much faster.
The AI Cosmologist I: An Agentic System for Automated Data Analysis
Instrumentation and Methods for Astrophysics
AI discovers new space science by doing research.
AI Agents for Ground-Based Gamma Astronomy
Instrumentation and Methods for Astrophysics
AI helps scientists run telescopes and analyze space data.