F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking | Read Paper on Bytez