Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining | Read Paper on Bytez