b

DiscoverModelsSearch
About
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training
2 weeks ago
·
NeurIPS