Panda: Self-distillation of Reusable Sensor-level Representations for High Energy Physics
By: Samuel Young, Kazuhiro Terao
Potential Business Impact:
Teaches computers to see tiny particle paths.
Liquid argon time projection chambers (LArTPCs) provide dense, high-fidelity 3D measurements of particle interactions and underpin current and future neutrino and rare-event experiments. Physics reconstruction typically relies on complex detector-specific pipelines that use tens of hand-engineered pattern recognition algorithms or cascades of task-specific neural networks that require extensive, labeled simulation that requires a careful, time-consuming calibration process. We introduce \textbf{Panda}, a model that learns reusable sensor-level representations directly from raw unlabeled LArTPC data. Panda couples a hierarchical sparse 3D encoder with a multi-view, prototype-based self-distillation objective. On a simulated dataset, Panda substantially improves label efficiency and reconstruction quality, beating the previous state-of-the-art semantic segmentation model with 1,000$\times$ fewer labels. We also show that a single set-prediction head 1/20th the size of the backbone with no physical priors trained on frozen outputs from Panda can result in particle identification that is comparable with state-of-the-art (SOTA) reconstruction tools. Full fine-tuning further improves performance across all tasks.
Similar Papers
Particle Hit Clustering and Identification Using Point Set Transformers in Liquid Argon Time Projection Chambers
High Energy Physics - Experiment
Finds tiny particles faster and uses less computer power.
Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs
CV and Pattern Recognition
Improves robot vision in dark or far away.
An Evaluation of Representation Learning Methods in Particle Physics Foundation Models
Machine Learning (CS)
Teaches computers to understand tiny particles better.