Bayesian Learning with Information Gain Provably Bounds Risk for a Robust Adversarial Defense

Bao Gia Doan, Ehsan Abbasnejad, Javen Qinfeng Shi, Damith C. Ranashinghe

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

1 Citation (Scopus)

Abstract

We present a new algorithm to learn a deep neural network model robust against adversarial attacks. Previous algorithms demonstrate an adversarially trained Bayesian Neural Network (BNN) provides improved robustness. We recognize the adversarial learning approach for approximating the multi-modal posterior distribution of a Bayesian model can lead to mode collapse; consequently, the model's achievements in robustness and performance are sub-optimal. Instead, we first propose preventing mode collapse to better approximate the multi-modal posterior distribution. Second, based on the intuition that a robust model should ignore perturbations and only consider the informative content of the input, we conceptualize and formulate an information gain objective to measure and force the information learned from both benign and adversarial training instances to be similar. Importantly. we prove and demonstrate that minimizing the information gain objective allows the adversarial risk to approach the conventional empirical risk. We believe our efforts provide a step toward a basis for a principled method of adversarially training BNNs. Our model demonstrate significantly improved robustness-up to 20%-compared with adversarial training (Madry et al., 2018) and Adv-BNN (Liu et al., 2019) under PGD attacks with 0.035 distortion on both CIFAR-10 and STL-10 datasets.

Original languageEnglish
Title of host publicationProceedings of the 39th International Conference on Machine Learning 2022
EditorsKamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, Sivan Sabato
Place of PublicationLondon UK
PublisherProceedings of Machine Learning Research (PMLR)
Pages5309-5323
Number of pages15
Volume162
Publication statusPublished - 2022
Externally publishedYes
EventInternational Conference on Machine Learning 2022 - Baltimore, United States of America
Duration: 17 Jul 202223 Jul 2022
Conference number: 396th
https://icml.cc/Conferences/2022
https://icml.cc/virtual/2022/index.html (Website)
https://proceedings.mlr.press/v162/ (Proceedings)

Conference

ConferenceInternational Conference on Machine Learning 2022
Abbreviated titleICML 2022
Country/TerritoryUnited States of America
CityBaltimore
Period17/07/2223/07/22
Internet address

Cite this