ChemPile: A 250 GB Diverse and Curated Dataset for Chemical Foundation Models | Read Paper on Bytez