bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning | Read Paper on Bytez