Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis | Read Paper on Bytez