Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability | Read Paper on Bytez