Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models | Read Paper on Bytez