On the Provable Separation of Scales in Maximal Update Parameterization | Read Paper on Bytez