Text-Image Alignment for Diffusion-Based Perception | Read Paper on Bytez