Why Tree-Style Branching Matters for Thought Advantage Estimation in GRPO | Read Paper on Bytez