Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents | Read Paper on Bytez