Rethinking Gauss-Newton for learning over-parameterized models | Read Paper on Bytez