Deep Learning-based Binary Analysis for Vulnerability Detection in x86-64 Machine Code
By: Mitchell Petingola
Potential Business Impact:
Finds computer bugs directly from code.
While much of the current research in deep learning-based vulnerability detection relies on disassembled binaries, this paper explores the feasibility of extracting features directly from raw x86-64 machine code. Although assembly language is more interpretable for humans, it requires more complex models to capture token-level context. In contrast, machine code may enable more efficient, lightweight models and preserve all information that might be lost in disassembly. This paper approaches the task of vulnerability detection through an exploratory study on two specific deep learning model architectures and aims to systematically evaluate their performance across three vulnerability types. The results demonstrate that graph-based models consistently outperform sequential models, emphasizing the importance of control flow relationships, and that machine code contains sufficient information for effective vulnerability discovery.
Similar Papers
Code Vulnerability Detection Across Different Programming Languages with AI Models
Cryptography and Security
Finds hidden bugs in computer code.
From Lab to Reality: A Practical Evaluation of Deep Learning Models and LLMs for Vulnerability Detection
Cryptography and Security
Finds hidden computer bugs in new code.
A Decompilation-Driven Framework for Malware Detection with Large Language Models
Cryptography and Security
Helps computers spot bad computer programs.