DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning | Read Paper on Bytez