Not All Tokens Are What You Need for Pretraining | Read Paper on Bytez