b
Discover
Models
Search
About
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
2 weeks ago
·
NeurIPS