Score: 0

Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations

Published: August 28, 2025 | arXiv ID: 2509.05311v1

By: Konur Tholl , François Rivest , Mariam El Mezouar and more

Potential Business Impact:

Teaches computers to fight cyber threats faster.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Reinforcement Learning (RL) has shown great potential for autonomous decision-making in the cybersecurity domain, enabling agents to learn through direct environment interaction. However, RL agents in Autonomous Cyber Operations (ACO) typically learn from scratch, requiring them to execute undesirable actions to learn their consequences. In this study, we integrate external knowledge in the form of a Large Language Model (LLM) pretrained on cybersecurity data that our RL agent can directly leverage to make informed decisions. By guiding initial training with an LLM, we improve baseline performance and reduce the need for exploratory actions with obviously negative outcomes. We evaluate our LLM-integrated approach in a simulated cybersecurity environment, and demonstrate that our guided agent achieves over 2x higher rewards during early training and converges to a favorable policy approximately 4,500 episodes faster than the baseline.

Large Language Models are Autonomous Cyber Defenders

Artificial Intelligence

Helps AI teams fight computer attackers together.

7 May 2025 1

92%

Large Language Model-Based Reward Design for Deep Reinforcement Learning-Driven Autonomous Cyber Defense

Machine Learning (CS)

Teaches computers to defend against cyberattacks.

20 Nov 2025 0

92%

Enhancing Cloud Network Resilience via a Robust LLM-Empowered Multi-Agent Reinforcement Learning Framework

Cryptography and Security

AI guards computer clouds from hackers better.

12 Jan 2026 1

View PDF Login to Bookmark

Page Count

14 pages

Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations

Teaches computers to fight cyber threats faster.

Technical Abstract

Large Language Models are Autonomous Cyber Defenders

Large Language Model-Based Reward Design for Deep Reinforcement Learning-Driven Autonomous Cyber Defense

Enhancing Cloud Network Resilience via a Robust LLM-Empowered Multi-Agent Reinforcement Learning Framework