Object-centric binding in Contrastive Language-Image Pretraining | Read Paper on Bytez