Train faster, generalize better: Stability of stochastic gradient descent | Read Paper on Bytez