Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging | Read Paper on Bytez