CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Read Paper on Bytez