b
Discover
Models
Search
About
μ
P
2
\boldsymbol{\mu}\mathbf{P^2}
μ
P
2
: Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling
4 weeks ago
·
NeurIPS