Score: 1

BERTector: An Intrusion Detection Framework Constructed via Joint-dataset Learning Based on Language Model

Published: August 14, 2025 | arXiv ID: 2508.10327v2

By: Haoyang Hu , Xun Huang , Chenyu Wu and more

Potential Business Impact:

Finds hidden computer threats better.

Intrusion detection systems (IDS) are widely used to maintain the stability of network environments, but still face restrictions in generalizability due to the heterogeneity of network traffics. In this work, we propose BERTector, a new framework of joint-dataset learning for IDS based on BERT. BERTector integrates three key components: NSS-Tokenizer for traffic-aware semantic tokenization, supervised fine-tuning with a hybrid dataset, and low-rank adaptation for efficient fine-tuning. Experiments show that BERTector achieves state-of-the-art detection accuracy, strong generalizability, and excellent robustness. BERTector achieves the highest accuracy of 99.28% on NSL-KDD and reaches the average 80% detection success rate against four perturbations. These results establish a unified and efficient solution for modern IDS in complex and dynamic network environments.

Country of Origin
🇨🇳 China

Page Count
5 pages

Category
Computer Science:
Cryptography and Security