Score: 0

From Retrieval to Reasoning: A Framework for Cyber Threat Intelligence NER with Explicit and Adaptive Instructions

Published: December 22, 2025 | arXiv ID: 2512.19414v1

By: Jiaren Peng , Hongda Sun , Xuan Tian and more

Potential Business Impact:

Helps computers find bad guys' secrets faster.

Business Areas:

Semantic Search Internet Services

The automation of Cyber Threat Intelligence (CTI) relies heavily on Named Entity Recognition (NER) to extract critical entities from unstructured text. Currently, Large Language Models (LLMs) primarily address this task through retrieval-based In-Context Learning (ICL). This paper analyzes this mainstream paradigm, revealing a fundamental flaw: its success stems not from global semantic similarity but largely from the incidental overlap of entity types within retrieved examples. This exposes the limitations of relying on unreliable implicit induction. To address this, we propose TTPrompt, a framework shifting from implicit induction to explicit instruction. TTPrompt maps the core concepts of CTI's Tactics, Techniques, and Procedures (TTPs) into an instruction hierarchy: formulating task definitions as Tactics, guiding strategies as Techniques, and annotation guidelines as Procedures. Furthermore, to handle the adaptability challenge of static guidelines, we introduce Feedback-driven Instruction Refinement (FIR). FIR enables LLMs to self-refine guidelines by learning from errors on minimal labeled data, adapting to distinct annotation dialects. Experiments on five CTI NER benchmarks demonstrate that TTPrompt consistently surpasses retrieval-based baselines. Notably, with refinement on just 1% of training data, it rivals models fine-tuned on the full dataset. For instance, on LADDER, its Micro F1 of 71.96% approaches the fine-tuned baseline, and on the complex CTINexus, its Macro F1 exceeds the fine-tuned ACLM model by 10.91%.

Advancing Autonomous Incident Response: Leveraging LLMs and Cyber Threat Intelligence

Cryptography and Security

Helps computers fight cyber threats faster.

14 Aug 2025 0

90%

Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

Computation and Language

Helps computers understand medical notes better.

25 Jul 2025 0

90%

Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction

Cryptography and Security

Finds computer attack clues without needing examples.

8 Jan 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

13 pages

From Retrieval to Reasoning: A Framework for Cyber Threat Intelligence NER with Explicit and Adaptive Instructions

Helps computers find bad guys' secrets faster.

Technical Abstract

Advancing Autonomous Incident Response: Leveraging LLMs and Cyber Threat Intelligence

Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction