bytez
Search
Feed
Models
Agent
Devs
Model API
docs
Can speed up the convergence rate of stochastic gradient methods to $\mathcal{O}(1/k^2)$ by a gradient averaging strategy? | Read Paper on Bytez