Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks | Read Paper on Bytez