Classification Done Right for Vision-Language Pre-Training | Read Paper on Bytez