Multiplication-Free Transformer Training via Piecewise Affine Operations | Read Paper on Bytez