Score: 0

BEACON: Behavioral Malware Classification with Large Language Model Embeddings and Deep Learning

Published: September 18, 2025 | arXiv ID: 2509.14519v1

By: Wadduwage Shanika Perera, Haodi Jiang

Potential Business Impact:

Finds hidden computer viruses by watching what they do.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Malware is becoming increasingly complex and widespread, making it essential to develop more effective and timely detection methods. Traditional static analysis often fails to defend against modern threats that employ code obfuscation, polymorphism, and other evasion techniques. In contrast, behavioral malware detection, which monitors runtime activities, provides a more reliable and context-aware solution. In this work, we propose BEACON, a novel deep learning framework that leverages large language models (LLMs) to generate dense, contextual embeddings from raw sandbox-generated behavior reports. These embeddings capture semantic and structural patterns of each sample and are processed by a one-dimensional convolutional neural network (1D CNN) for multi-class malware classification. Evaluated on the Avast-CTU Public CAPE Dataset, our framework consistently outperforms existing methods, highlighting the effectiveness of LLM-based behavioral embeddings and the overall design of BEACON for robust malware classification.

BEACON: A Unified Behavioral-Tactical Framework for Explainable Cybercrime Analysis with Large Language Models

Cryptography and Security

Finds scam tricks by understanding how people think.

6 Dec 2025 0

88%

MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)

Cryptography and Security

Explains what bad computer programs do.

1 Apr 2025 1

87%

MalCVE: Malware Detection and CVE Association Using Large Language Models

Cryptography and Security

Finds computer viruses and their weaknesses.

17 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

8 pages

BEACON: Behavioral Malware Classification with Large Language Model Embeddings and Deep Learning

Finds hidden computer viruses by watching what they do.

Technical Abstract

BEACON: A Unified Behavioral-Tactical Framework for Explainable Cybercrime Analysis with Large Language Models

MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)

MalCVE: Malware Detection and CVE Association Using Large Language Models