Generating compositional scenes via Text-to-image RGBA Instance Generation | Read Paper on Bytez