b
Discover
Models
Search
About
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
2023
·
NeurIPS