Instruction-based Image Manipulation by Watching How Things Move | Read Paper on Bytez