Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Read Paper on Bytez