Order Matters: Agent-by-agent Policy Optimization | Read Paper on Bytez