bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning | Read Paper on Bytez