MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization | Read Paper on Bytez