Score: 0

The Procedural Semantics Gap in Structured CTI: A Measurement-Driven STIX Analysis for APT Emulation

Published: December 12, 2025 | arXiv ID: 2512.12078v1

By: Ágney Lopes Roth Ferraz , Sidnei Barbieri , Murray Evangelista de Souza and more

Potential Business Impact:

Makes computer attack plans work like video games.

Business Areas:

Penetration Testing Information Technology, Privacy and Security

Cyber threat intelligence (CTI) encoded in STIX and structured according to the MITRE ATT&CK framework has become a global reference for describing adversary behavior. However, ATT&CK was designed as a descriptive knowledge base rather than a procedural model. We therefore ask whether its structured artifacts contain sufficient behavioral detail to support multi-stage adversary emulation. Through systematic measurements of the ATT&CK Enterprise bundle, we show that campaign objects encode just fragmented slices of behavior. Only 35.6% of techniques appear in at least one campaign, and neither clustering nor sequence analysis reveals any reusable behavioral structure under technique overlap or LCS-based analyses. Intrusion sets cover a broader portion of the technique space, yet omit the procedural semantics required to transform behavioral knowledge into executable chains, including ordering, preconditions, and environmental assumptions. These findings reveal a procedural semantic gap in current CTI standards: they describe what adversaries do, but not exactly how that behavior was operationalized. To assess how far this gap can be bridged in practice, we introduce a three-stage methodology that translates behavioral information from structured CTI into executable steps and makes the necessary environmental assumptions explicit. We demonstrate its viability by instantiating the resulting steps as operations in the MITRE Caldera framework. Case studies of ShadowRay and Soft Cell show that structured CTI can enable the emulation of multi-stage APT campaigns, but only when analyst-supplied parameters and assumptions are explicitly recorded. This, in turn, exposes the precise points at which current standards fail to support automation. Our results clarify the boundary between descriptive and machine-actionable CTI for adversary emulation.

CTI-HAL: A Human-Annotated Dataset for Cyber Threat Intelligence Analysis

Cryptography and Security

Helps computers understand online threats faster.

8 Apr 2025 1

87%

Enabling Transparent Cyber Threat Intelligence Combining Large Language Models and Domain Ontologies

Cryptography and Security

Helps computers find bad guys in computer logs.

26 Aug 2025 0

86%

KnowHow: Automatically Applying High-Level CTI Knowledge for Interpretable and Accurate Provenance Analysis

Cryptography and Security

Finds hidden computer attacks using smart language.

6 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇧🇷 Brazil

Page Count

14 pages

The Procedural Semantics Gap in Structured CTI: A Measurement-Driven STIX Analysis for APT Emulation

Makes computer attack plans work like video games.

Technical Abstract

CTI-HAL: A Human-Annotated Dataset for Cyber Threat Intelligence Analysis

Enabling Transparent Cyber Threat Intelligence Combining Large Language Models and Domain Ontologies

KnowHow: Automatically Applying High-Level CTI Knowledge for Interpretable and Accurate Provenance Analysis