Asymmetric Masked Distillation for Pre-Training Small Foundation Models | Read Paper on Bytez