Towards Fully FP8 GEMM LLM Training at Scale | Read Paper on Bytez