Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs | Read Paper on Bytez