CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning | Read Paper on Bytez