Abstract
We present a new algorithm to learn a deep neural network model robust against adversarial attacks. Previous algorithms demonstrate an adversarially trained Bayesian Neural Network (BNN) provides improved robustness. We recognize the adversarial learning approach for approximating the multi-modal posterior distribution of a Bayesian model can lead to mode collapse; consequently, the model's achievements in robustness and performance are sub-optimal. Instead, we first propose preventing mode collapse to better approximate the multi-modal posterior distribution. Second, based on the intuition that a robust model should ignore perturbations and only consider the informative content of the input, we conceptualize and formulate an information gain objective to measure and force the information learned from both benign and adversarial training instances to be similar. Importantly. we prove and demonstrate that minimizing the information gain objective allows the adversarial risk to approach the conventional empirical risk. We believe our efforts provide a step toward a basis for a principled method of adversarially training BNNs. Our model demonstrate significantly improved robustness-up to 20%-compared with adversarial training (Madry et al., 2018) and Adv-BNN (Liu et al., 2019) under PGD attacks with 0.035 distortion on both CIFAR-10 and STL-10 datasets.
Original language | English |
---|---|
Title of host publication | Proceedings of the 39th International Conference on Machine Learning 2022 |
Editors | Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, Sivan Sabato |
Place of Publication | London UK |
Publisher | Proceedings of Machine Learning Research (PMLR) |
Pages | 5309-5323 |
Number of pages | 15 |
Volume | 162 |
Publication status | Published - 2022 |
Externally published | Yes |
Event | International Conference on Machine Learning 2022 - Baltimore, United States of America Duration: 17 Jul 2022 → 23 Jul 2022 Conference number: 396th https://icml.cc/Conferences/2022 https://icml.cc/virtual/2022/index.html (Website) https://proceedings.mlr.press/v162/ (Proceedings) |
Conference
Conference | International Conference on Machine Learning 2022 |
---|---|
Abbreviated title | ICML 2022 |
Country/Territory | United States of America |
City | Baltimore |
Period | 17/07/22 → 23/07/22 |
Internet address |