On the training of infinite depth and width Residual Networks

François-Xavier Vialard (U Gustave Eiffel, Paris)

Feb 11. 2025, 14:00 — 14:50

In this talk, we explain why the training of Residual Networks (ResNets) is relatively easy in two limiting cases. The first one is infinite depth and linear parametrization of the residual blocks. The second is infinite depth and infinite width.
For the second case, we introduce the conditional Wasserstein distance which naturally appears as the metric structure to train this limiting model, which encompasses the infinite depth and finite width setting. The main technical result is to prove a local Polyak-Lojasiewicz inequality in the first case and the existence of the flow in the second case.

Further Information

Venue:: ESI Boltzmann Lecture Hall
Associated Event:: Infinite-dimensional Geometry: Theory and Applications (Thematic Programme)
Organizer(s):: Tomasz Goliński (U of Białystok)
Gabriel Larotonda (U of Buenos Aires)
Alice Barbara Tumpach (WPI, Vienna)
Cornelia Vizman (WU of Timisoara)