b
Discover
Models
Search
About
To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability
6 months ago
·
arXiv