bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Hyperparameter Transfer Enables Consistent Gains of Matrix-Preconditioned Optimizers Across Scales | Read Paper on Bytez