Colloquium: Nathan Srebro (TTIC) | Department of Mathematics

3:00–4:00 pm Eckhart Hall, Room 202

Understanding Deep Learning Through Optimization Geometry

Abstract:

How can models with more parameters than training examples generalize well, and generalize even better when we add even more parameters? In recent years, it is becoming increasingly clear that such generalization ability comes from the optimization bias, or implicit bias, of the training procedures. In this talk, I will survey our work from the past several years on highlighting the role of optimization geometry in determining such implicit bias, and understanding deep learning through it.

Event Type

Nov 29