Score: 0

Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses

Published: October 23, 2025 | arXiv ID: 2510.20314v1

By: Wu Yichao , Wang Yirui , Ding Panpan and more

Potential Business Impact:

Makes smart machines safer from tricky attacks.

Business Areas:

Security Privacy and Security

With the wide application of deep reinforcement learning (DRL) techniques in complex fields such as autonomous driving, intelligent manufacturing, and smart healthcare, how to improve its security and robustness in dynamic and changeable environments has become a core issue in current research. Especially in the face of adversarial attacks, DRL may suffer serious performance degradation or even make potentially dangerous decisions, so it is crucial to ensure their stability in security-sensitive scenarios. In this paper, we first introduce the basic framework of DRL and analyze the main security challenges faced in complex and changing environments. In addition, this paper proposes an adversarial attack classification framework based on perturbation type and attack target and reviews the mainstream adversarial attack methods against DRL in detail, including various attack methods such as perturbation state space, action space, reward function and model space. To effectively counter the attacks, this paper systematically summarizes various current robustness training strategies, including adversarial training, competitive training, robust learning, adversarial detection, defense distillation and other related defense techniques, we also discuss the advantages and shortcomings of these methods in improving the robustness of DRL. Finally, this paper looks into the future research direction of DRL in adversarial environments, emphasizing the research needs in terms of improving generalization, reducing computational complexity, and enhancing scalability and explainability, aiming to provide valuable references and directions for researchers.

Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach

Machine Learning (CS)

Protects self-driving cars from tricky computer tricks.

22 Jul 2025 0

89%

Baiting AI: Deceptive Adversary Against AI-Protected Industrial Infrastructures

Cryptography and Security

Makes hackers secretly break water plants.

13 Jan 2026 1

89%

Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks

Machine Learning (CS)

Tricks robots into making bad choices.

26 Mar 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

23 pages

Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses

Makes smart machines safer from tricky attacks.

Technical Abstract

Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach

Baiting AI: Deceptive Adversary Against AI-Protected Industrial Infrastructures

Robust Deep Reinforcement Learning in Robotics via Adaptive Gradient-Masked Adversarial Attacks