On the SDEs and Scaling Rules for Adaptive Gradient Algorithms
Sadhika Malladi$\text{ }^{*}$, Kaifeng Lyu$\text{ }^{*}$, Abhishek Panigrahi, Sanjeev Arora Published at: Neural Information Processing Systems (NeurIPS), 2022
[paper]
Oral presentation (270/3000 submissions ≈ 9% Acceptance Rate).