Efficient Multimodal Fusion via Interactive Prompting | Read Paper on Bytez