The MiniPile Challenge for Data-Efficient Language Models | Read Paper on Bytez