Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data | Read Paper on Bytez