bytez
Search
Feed
Models
Agent
Devs
Model API
docs
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training | Read Paper on Bytez