A Robust Adaptive Stochastic Gradient Method for Deep Learning | Read Paper on Bytez