Iterate averaging as regularization for stochastic gradient descent
2018·Arxiv