Score: 1

AdaGAT: Adaptive Guidance Adversarial Training for the Robustness of Deep Neural Networks

Published: August 24, 2025 | arXiv ID: 2508.17265v1

By: Zhenyu Liu , Huizhi Liang , Xinrun Li and more

Potential Business Impact:

Makes small computer brains smarter and tougher.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Adversarial distillation (AD) is a knowledge distillation technique that facilitates the transfer of robustness from teacher deep neural network (DNN) models to lightweight target (student) DNN models, enabling the target models to perform better than only training the student model independently. Some previous works focus on using a small, learnable teacher (guide) model to improve the robustness of a student model. Since a learnable guide model starts learning from scratch, maintaining its optimal state for effective knowledge transfer during co-training is challenging. Therefore, we propose a novel Adaptive Guidance Adversarial Training (AdaGAT) method. Our method, AdaGAT, dynamically adjusts the training state of the guide model to install robustness to the target model. Specifically, we develop two separate loss functions as part of the AdaGAT method, allowing the guide model to participate more actively in backpropagation to achieve its optimal state. We evaluated our approach via extensive experiments on three datasets: CIFAR-10, CIFAR-100, and TinyImageNet, using the WideResNet-34-10 model as the target model. Our observations reveal that appropriately adjusting the guide model within a certain accuracy range enhances the target model's robustness across various adversarial attacks compared to a variety of baseline models.

Calibrated Adversarial Sampling: Multi-Armed Bandit-Guided Generalization Against Unforeseen Attacks

Machine Learning (CS)

Makes AI smarter and safer from tricks.

15 Nov 2025 0

88%

Adversarial Distilled Retrieval-Augmented Guarding Model for Online Malicious Intent Detection

Cryptography and Security

Stops bad online messages faster and better.

18 Sep 2025 0

88%

Graph-Attention Network with Adversarial Domain Alignment for Robust Cross-Domain Facial Expression Recognition

CV and Pattern Recognition

Helps computers recognize faces in different pictures.

29 Nov 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

14 pages

AdaGAT: Adaptive Guidance Adversarial Training for the Robustness of Deep Neural Networks

Makes small computer brains smarter and tougher.

Technical Abstract

Calibrated Adversarial Sampling: Multi-Armed Bandit-Guided Generalization Against Unforeseen Attacks

Adversarial Distilled Retrieval-Augmented Guarding Model for Online Malicious Intent Detection

Graph-Attention Network with Adversarial Domain Alignment for Robust Cross-Domain Facial Expression Recognition