Relative flatness and generalization

Henning Petzka, Michael Kamp, Linara Adilova, Cristian Sminchisescu, Mario Boley

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

44 Citations (Scopus)

Abstract

Flatness of the loss curve is conjectured to be connected to the generalization ability of machine learning models, in particular neural networks. While it has been empirically observed that flatness measures consistently correlate strongly with generalization, it is still an open theoretical problem why and under which circumstances flatness is connected to generalization, in particular in light of reparameterizations that change certain flatness measures but leave generalization unchanged. We investigate the connection between flatness and generalization by relating it to the interpolation from representative data, deriving notions of representativeness, and feature robustness. The notions allow us to rigorously connect flatness and generalization and to identify conditions under which the connection holds. Moreover, they give rise to a novel, but natural relative flatness measure that correlates strongly with generalization, simplifies to ridge regression for ordinary least squares, and solves the reparameterization issue.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 34 (NeurIPS 2021)
EditorsMarc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
Place of PublicationSan Diego CA USA
PublisherNeural Information Processing Systems (NIPS)
Pages18420-18432
Number of pages13
ISBN (Electronic)9781713845393
Publication statusPublished - 2021
EventAdvances in Neural Information Processing Systems 2021 - Online, United States of America
Duration: 7 Dec 202110 Dec 2021
Conference number: 35th
https://papers.nips.cc/paper/2021 (Proceedings)
https://nips.cc/Conferences/2021 (Website)

Publication series

NameAdvances in Neural Information Processing Systems
PublisherNeural Information Processing Systems (NIPS)
Volume22
ISSN (Print)1049-5258

Conference

ConferenceAdvances in Neural Information Processing Systems 2021
Abbreviated titleNeurIPS 2021
Country/TerritoryUnited States of America
CityOnline
Period7/12/2110/12/21
Internet address

Cite this