Marco-MoE: Open Multilingual Mixture-of-Expert Language Models with Efficient Upcycling | Read Paper on Bytez