Edward Milsom Profile picture
Lecturer (Assistant Professor) at University of Bath working on the foundations of deep learning
Feb 25 12 tweets 4 min read
Our paper "Function-Space Learning Rates" is on arXiv! We give an efficient way to estimate the magnitude of changes to NN outputs caused by a particular weight update. We analyse optimiser dynamics in function space, and enable hyperparameter transfer with our scheme FLeRM! 🧵👇 Image NN training learns input-output functions. But we typically understand optimisers as acting in parameter space. For example, traditional learning rates tell us how much parameters change each step, rather than the effect on the learned function.