bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs | Read Paper on Bytez