Large Language Models are Visual Reasoning Coordinators | Read Paper on Bytez