Phase diagram of early training dynamics in deep neural networks: effect of the learning rate, depth, and width | Read Paper on Bytez