b

DiscoverModelsSearch
About
(S)GD over Diagonal Linear Networks: Implicit bias, Large Stepsizes and Edge of Stability
2023
ยท
NeurIPS