Evasion Attacks Against Bayesian Predictive Models
By: Pablo G. Arce, Roi Naveiro, David Ríos Insua
Potential Business Impact:
Makes smart programs harder to trick.
There is an increasing interest in analyzing the behavior of machine learning systems against adversarial attacks. However, most of the research in adversarial machine learning has focused on studying weaknesses against evasion or poisoning attacks to predictive models in classical setups, with the susceptibility of Bayesian predictive models to attacks remaining underexplored. This paper introduces a general methodology for designing optimal evasion attacks against such models. We investigate two adversarial objectives: perturbing specific point predictions and altering the entire posterior predictive distribution. For both scenarios, we propose novel gradient-based attacks and study their implementation and properties in various computational setups.
Similar Papers
Poisoning Bayesian Inference via Data Deletion and Replication
Machine Learning (Stat)
Makes AI believe false things by changing its data.
Detecting and Preventing Data Poisoning Attacks on AI Models
Cryptography and Security
Protects smart programs from bad data.
A unified Bayesian framework for adversarial robustness
Machine Learning (Stat)
Protects computer brains from sneaky tricks.