Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Read Paper on Bytez