Diagonal Rescaling For Neural Networks | Read Paper on Bytez