Deep Grokking: Would Deep Neural Networks Generalize Better? | Read Paper on Bytez