MONet: Unsupervised Scene Decomposition and Representation | Read Paper on Bytez