Variance Reduction for Distributed Stochastic Gradient Descent | Read Paper on Bytez