Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion | Read Paper on Bytez