LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing | Read Paper on Bytez