bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Train longer, generalize better: closing the generalization gap in large batch training of neural networks | Read Paper on Bytez