Score: 0

Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study

Published: December 3, 2025 | arXiv ID: 2512.04031v1

By: Yixuan Li , Yuhao Lu , Yang Liu and more

Potential Business Impact:

Finds space signals better with less data.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

This work investigates whether large language models (LLMs) offer advantages over traditional neural networks for astronomical data processing, in regimes with non-Gaussian, non-stationary noise and limited labeled samples. Gravitational wave observations provide an suitable test case, using only 90 LIGO events, finetuned LLMs achieve 97.4\% accuracy for identifying signals. Further experiments show that, in contrast to traditional networks that rely on large simulated datasets, additional simulated samples do not improve LLM performance, while scaling studies reveal predictable gains with increasing model size and dataset size. These results indicate that LLMs can extract discriminative structure directly from observational data and provide an efficient assessment for gravitational wave identification. The same strategy may extend to other astronomical domains with similar noise properties, such as radio or pulsar observations.

Textual interpretation of transient image classifications from large language models

Instrumentation and Methods for Astrophysics

Helps find real space explosions in telescope pictures.

8 Oct 2025 0

89%

Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries

Computation and Language

Teaches computers to understand space science from text.

18 Nov 2025 1

88%

Can Large Language Models Help Experimental Design for Causal Discovery?

Artificial Intelligence

Lets computers find science answers faster.

3 Mar 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

11 pages

Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study

Finds space signals better with less data.

Technical Abstract

Textual interpretation of transient image classifications from large language models

Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries

Can Large Language Models Help Experimental Design for Causal Discovery?