101 - The lottery ticket hypothesis, with Jonathan Frankle

Published: Jan. 14, 2020, 5:52 p.m.

In this episode, Jonathan Frankle describes the lottery ticket hypothesis, a popular explanation of how over-parameterization helps in training neural networks. We discuss pruning methods used to uncover subnetworks (winning tickets) which were initialized in a particularly effective way. We also discuss patterns observed in pruned networks, stability of networks pruned at different time steps and transferring uncovered subnetworks across tasks, among other topics.\n\nA recent paper on the topic by Frankle and Carbin, ICLR 2019: https://arxiv.org/abs/1803.03635\n\nJonathan Frankle\u2019s homepage: http://www.jfrankle.com/