Scalable Model Merging with Progressive Layer-wise Distillation | Read Paper on Bytez