Projects per year
Abstract
Flatness of the loss curve is conjectured to be connected to the generalization ability of machine learning models, in particular neural networks. While it has been empirically observed that flatness measures consistently correlate strongly with generalization, it is still an open theoretical problem why and under which circumstances flatness is connected to generalization, in particular in light of reparameterizations that change certain flatness measures but leave generalization unchanged. We investigate the connection between flatness and generalization by relating it to the interpolation from representative data, deriving notions of representativeness, and feature robustness. The notions allow us to rigorously connect flatness and generalization and to identify conditions under which the connection holds. Moreover, they give rise to a novel, but natural relative flatness measure that correlates strongly with generalization, simplifies to ridge regression for ordinary least squares, and solves the reparameterization issue.
Original language | English |
---|---|
Title of host publication | Advances in Neural Information Processing Systems 34 (NeurIPS 2021) |
Editors | Marc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan |
Place of Publication | San Diego CA USA |
Publisher | Neural Information Processing Systems (NIPS) |
Pages | 18420-18432 |
Number of pages | 13 |
ISBN (Electronic) | 9781713845393 |
Publication status | Published - 2021 |
Event | Advances in Neural Information Processing Systems 2021 - Online, United States of America Duration: 7 Dec 2021 → 10 Dec 2021 Conference number: 35th https://papers.nips.cc/paper/2021 (Proceedings) https://nips.cc/Conferences/2021 (Website) |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
Publisher | Neural Information Processing Systems (NIPS) |
Volume | 22 |
ISSN (Print) | 1049-5258 |
Conference
Conference | Advances in Neural Information Processing Systems 2021 |
---|---|
Abbreviated title | NeurIPS 2021 |
Country/Territory | United States of America |
City | Online |
Period | 7/12/21 → 10/12/21 |
Internet address |
|
Projects
- 1 Finished
-
Rethinking the Data-driven Discovery of Rare Phenomena
Boley, M. (Primary Chief Investigator (PCI)), Buntine, W. (Partner Investigator (PI)), Schmidt, D. (Chief Investigator (CI)), Kuhlmann, L. (Chief Investigator (CI)) & Scheffler, M. (Partner Investigator (PI))
29/07/21 → 28/11/24
Project: Research