CLIPPO: Image-and-Language Understanding From Pixels Only | Read Paper on Bytez