On the Consistency of GNN Explanations for Malware Detection
By: Hossein Shokouhinejad , Griffin Higgins , Roozbeh Razavi-Far and more
Potential Business Impact:
Finds computer viruses by understanding code paths.
Control Flow Graphs (CFGs) are critical for analyzing program execution and characterizing malware behavior. With the growing adoption of Graph Neural Networks (GNNs), CFG-based representations have proven highly effective for malware detection. This study proposes a novel framework that dynamically constructs CFGs and embeds node features using a hybrid approach combining rule-based encoding and autoencoder-based embedding. A GNN-based classifier is then constructed to detect malicious behavior from the resulting graph representations. To improve model interpretability, we apply state-of-the-art explainability techniques, including GNNExplainer, PGExplainer, and CaptumExplainer, the latter is utilized three attribution methods: Integrated Gradients, Guided Backpropagation, and Saliency. In addition, we introduce a novel aggregation method, called RankFusion, that integrates the outputs of the top-performing explainers to enhance the explanation quality. We also evaluate explanations using two subgraph extraction strategies, including the proposed Greedy Edge-wise Composition (GEC) method for improved structural coherence. A comprehensive evaluation using accuracy, fidelity, and consistency metrics demonstrates the effectiveness of the proposed framework in terms of accurate identification of malware samples and generating reliable and interpretable explanations.
Similar Papers
Explainable Ensemble Learning for Graph-Based Malware Detection
Cryptography and Security
Finds computer viruses and explains why.
Explainable Attention-Guided Stacked Graph Neural Networks for Malware Detection
Cryptography and Security
Finds computer viruses and explains why.
A Research and Development Portfolio of GNN Centric Malware Detection, Explainability, and Dataset Curation
Cryptography and Security
Finds computer viruses faster and explains how.