Score: 1

Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

Published: September 10, 2025 | arXiv ID: 2509.08461v1

By: Dikshant Sagar , Kaiwen Yu , Alejandro Yankelevich and more

Potential Business Impact:

Helps scientists find tiny particles in pictures.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent advances in Large Language Models (LLMs) have demonstrated their remarkable capacity to process and reason over structured and unstructured data modalities beyond natural language. In this work, we explore the applications of Vision Language Models (VLMs), specifically a fine-tuned variant of LLaMa 3.2, to the task of identifying neutrino interactions in pixelated detector data from high-energy physics (HEP) experiments. We benchmark this model against a state-of-the-art convolutional neural network (CNN) architecture, similar to those used in the NOvA and DUNE experiments, which have achieved high efficiency and purity in classifying electron and muon neutrino events. Our evaluation considers both the classification performance and interpretability of the model predictions. We find that VLMs can outperform CNNs, while also providing greater flexibility in integrating auxiliary textual or semantic information and offering more interpretable, reasoning-based predictions. This work highlights the potential of VLMs as a general-purpose backbone for physics event classification, due to their high performance, interpretability, and generalizability, which opens new avenues for integrating multimodal reasoning in experimental neutrino physics.

Page Count
19 pages

Category
Computer Science:
Machine Learning (CS)