Towards Flash Thinking via Decoupled Advantage Policy Optimization | Read Paper on Bytez