b
Discover
Models
Search
About
Data-parallel distributed training of very large models beyond GPU capacity
2018
·
arXiv