Variance Reduction in SGD by Distributed Importance Sampling | Read Paper on Bytez