Mixture of Nested Experts: Adaptive Processing of Visual Tokens | Read Paper on Bytez