Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models | Read Paper on Bytez