Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion | Read Paper on Bytez