A shooting formulation of deep learning

François Xavier Vialard, Roland Kwitt, Susan Wei, Marc Niethammer

Research output: Chapter in Book/Report/Conference proceedingConference PaperOtherpeer-review

5 Citations (Scopus)

Abstract

A residual network may be regarded as a discretization of an ordinary differential equation (ODE) which, in the limit of time discretization, defines a continuous-depth network. Although important steps have been taken to realize the advantages of such continuous formulations, most current techniques assume identical layers. Indeed, existing works throw into relief the myriad difficulties of learning an infinite-dimensional parameter in a continuous-depth neural network. To this end, we introduce a shooting formulation which shifts the perspective from parameterizing a network layer-by-layer to parameterizing over optimal networks described only by a set of initial conditions. For scalability, we propose a novel particle-ensemble parameterization which fully specifies the optimal weight trajectory of the continuous-depth neural network. Our experiments show that our particle-ensemble shooting formulation can achieve competitive performance. Finally, though the current work is inspired by continuous-depth neural networks, the particle-ensemble shooting formulation also applies to discrete-time networks and may lead to a new fertile area of research in deep learning parameterization.

Original languageEnglish
Title of host publicationNIPS '20: Proceedings of the 34th International Conference on Neural Information Processing Systems
EditorsH. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, H. Lin
Place of PublicationUSA
PublisherAssociation for Computing Machinery (ACM)
Pages1-11
Number of pages11
ISBN (Print)9781713829546
Publication statusPublished - 2020
Externally publishedYes
EventAdvances in Neural Information Processing Systems 2020 - Virtual, Online, United States of America
Duration: 6 Dec 202012 Dec 2020
Conference number: 34th
https://proceedings.neurips.cc/paper/2020 (Proceedings )
https://nips.cc/Conferences/2020 (Website)

Publication series

NameAdvances in Neural Information Processing Systems
ISSN (Print)1049-5258

Conference

ConferenceAdvances in Neural Information Processing Systems 2020
Abbreviated titleNeurIPS 2020
Country/TerritoryUnited States of America
CityVirtual, Online
Period6/12/2012/12/20
Internet address

Cite this