bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling | Read Paper on Bytez