Does Visual Pretraining Help End-to-End Reasoning? | Read Paper on Bytez