Whitened CLIP as a Likelihood Surrogate of Images and Captions | Read Paper on Bytez