b

DiscoverModelsSearch
About
Sparse maximal update parameterization: A holistic approach to sparse training dynamics
1 week ago
·
NeurIPS