Automated Extraction of Protocol State Machines from 3GPP Specifications with Domain-Informed Prompts and LLM Ensembles
By: Miao Zhang , Runhan Feng , Hongbo Tang and more
Potential Business Impact:
Builds phone network rules automatically from text.
Mobile telecommunication networks are foundational to global infrastructure and increasingly support critical sectors such as manufacturing, transportation, and healthcare. The security and reliability of these networks are essential, yet depend heavily on accurate modeling of underlying protocols through state machines. While most prior work constructs such models manually from 3GPP specifications, this process is labor-intensive, error-prone, and difficult to maintain due to the complexity and frequent updates of the specifications. Recent efforts using natural language processing have shown promise, but remain limited in handling the scale and intricacy of cellular protocols. In this work, we propose SpecGPT, a novel framework that leverages large language models (LLMs) to automatically extract protocol state machines from 3GPP documents. SpecGPT segments technical specifications into meaningful paragraphs, applies domain-informed prompting with chain-of-thought reasoning, and employs ensemble methods to enhance output reliability. We evaluate SpecGPT on three representative 5G protocols (NAS, NGAP, and PFCP) using manually annotated ground truth, and show that it outperforms existing approaches, demonstrating the effectiveness of LLMs for protocol modeling at scale.
Similar Papers
Can Large Language Models Automate the Refinement of Cellular Network Specifications?
Cryptography and Security
Finds phone network security flaws automatically.
Synthesizing Precise Protocol Specs from Natural Language for Effective Test Generation
Software Engineering
Makes computer rules easy to test automatically.
Mobile Network-specialized Large Language Models for 6G: Architectures, Innovations, Challenges, and Future Trends
Networking and Internet Architecture
AI helps future phones work better automatically.