Score: 0

Application Of Large Language Models For The Extraction Of Information From Particle Accelerator Technical Documentation

Published: September 2, 2025 | arXiv ID: 2509.02227v1

By: Qing Dai , Rasmus Ischebeck , Maruisz Sapinski and more

Potential Business Impact:

Saves old machine secrets before experts leave.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The large set of technical documentation of legacy accelerator systems, coupled with the retirement of experienced personnel, underscores the urgent need for efficient methods to preserve and transfer specialized knowledge. This paper explores the application of large language models (LLMs), to automate and enhance the extraction of information from particle accelerator technical documents. By exploiting LLMs, we aim to address the challenges of knowledge retention, enabling the retrieval of domain expertise embedded in legacy documentation. We present initial results of adapting LLMs to this specialized domain. Our evaluation demonstrates the effectiveness of LLMs in extracting, summarizing, and organizing knowledge, significantly reducing the risk of losing valuable insights as personnel retire. Furthermore, we discuss the limitations of current LLMs, such as interpretability and handling of rare domain-specific terms, and propose strategies for improvement. This work highlights the potential of LLMs to play a pivotal role in preserving institutional knowledge and ensuring continuity in highly specialized fields.

Page Count
4 pages

Category
Computer Science:
Information Retrieval