Projects per year
Abstract
Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with lower sharpness have better generalization ability. However, existing posterior inferences are not aware of sharpness/flatness in terms of formulation, possibly leading to high sharpness for the models sampled from them. In this paper, we develop theories, the Bayesian setting, and the variational inference approach for the sharpness-aware posterior. Specifically, the models sampled from our sharpness-aware posterior, and the optimal approximate posterior estimating this sharpness-aware posterior, have better flatness, hence possibly possessing higher generalization ability. We conduct experiments by leveraging the sharpness-aware posterior with state-of-the-art Bayesian Neural Networks, showing that the flat-seeking counterparts outperform their baselines in all metrics of interest.
Original language | English |
---|---|
Title of host publication | Advances in Neural Information Processing Systems 36 (NeurIPS 2023) |
Editors | A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, S. Levine |
Place of Publication | San Diego CA USA |
Publisher | Neural Information Processing Systems (NIPS) |
Number of pages | 14 |
Publication status | Published - 2023 |
Event | Advances in Neural Information Processing Systems 2023 - Ernest N. Morial Convention Center, New Orleans, United States of America Duration: 10 Dec 2023 → 16 Dec 2023 Conference number: 37th https://openreview.net/group?id=NeurIPS.cc/2023/Conference#tab-accept-oral https://neurips.cc/ (Website) https://papers.nips.cc/paper_files/paper/2023 (Proceedings) |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
Publisher | Neural Information Processing Systems (NIPS) |
Volume | 36 |
ISSN (Print) | 1049-5258 |
Conference
Conference | Advances in Neural Information Processing Systems 2023 |
---|---|
Abbreviated title | NeurIPS 2023 |
Country/Territory | United States of America |
City | New Orleans |
Period | 10/12/23 → 16/12/23 |
Internet address |
Projects
- 1 Active
-
Exploiting Geometries of Learning for Fast, Adaptive and Robust AI
Phung, D. (Primary Chief Investigator (PCI)), Tafazzoli Harandi, M. (Chief Investigator (CI)), Hartley, R. I. (Chief Investigator (CI)), Le, T. (Chief Investigator (CI)) & Koniusz, P. (Partner Investigator (PI))
ARC - Australian Research Council
8/05/23 → 7/05/26
Project: Research