Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Read Paper on Bytez