Abstract
Ensemble-based adversarial training is a principled approach to achieve robustness against adversarial attacks. An important technique of this approach is to control the transferability of adversarial examples among ensemble members. We propose in this work a simple yet effective strategy to collaborate among committee models of an ensemble model. This is achieved via the secure and insecure sets defined for each model member on a given sample, hence help us to quantify and regularize the transferability. Consequently, our proposed framework provides the flexibility to reduce the adversarial transferability as well as to promote the diversity of ensemble members, which are two crucial factors for better robustness in our ensemble approach. We conduct extensive and comprehensive experiments to demonstrate that our proposed method outperforms the state-of-the-art ensemble baselines, at the same time can detect a wide range of adversarial examples with a nearly perfect accuracy. Our code is available at: https://github.com/tuananhbui89/Crossing-CollaborativeEnsemble.
Original language | English |
---|---|
Title of host publication | Proceedings of the AAAI Conference on Artificial Intelligence, AAAI-21 |
Place of Publication | Palo Alto CA USA |
Publisher | Association for the Advancement of Artificial Intelligence (AAAI) |
Pages | 6831-6839 |
Number of pages | 9 |
ISBN (Electronic) | 9781713835974 |
Publication status | Published - 2021 |
Event | AAAI Conference on Artificial Intelligence 2021 - Online, United States of America Duration: 2 Feb 2021 → 9 Feb 2021 Conference number: 35th https://aaai.org/Conferences/AAAI-21/ (Website) https://ojs.aaai.org/index.php/AAAI/issue/view/395 (Proceedings) |
Publication series
Name | 35th AAAI Conference on Artificial Intelligence, AAAI 2021 |
---|---|
Publisher | Association for the Advancement of Artificial Intelligence (AAAI) |
Volume | 8A |
ISSN (Print) | 2159-5399 |
ISSN (Electronic) | 2374-3468 |
Conference
Conference | AAAI Conference on Artificial Intelligence 2021 |
---|---|
Abbreviated title | AAAI 2021 |
Country/Territory | United States of America |
Period | 2/02/21 → 9/02/21 |
Internet address |
|
Keywords
- Adversarial Learning & Robustness
- Adversarial Attacks & Robustness
- Safety
- Robustness & Trustworthiness