On the interplay of network structure and gradient convergence in deep learning | Read Paper on Bytez