bytez
Search
Feed
Models
Agent
Devs
Plan
docs
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima | Read Paper on Bytez