b
Discover
Models
Search
About
Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models
1 week ago
·
NeurIPS