Group-in-Group Policy Optimization for LLM Agent Training | Read Paper on Bytez