Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models | Read Paper on Bytez