SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction | Read Paper on Bytez