Scaling Laws for Mixture Pretraining Under Data Constraints | Read Paper on Bytez