Video-Guided Foley Sound Generation with Multimodal Controls | Read Paper on Bytez