bytez
Search
Feed
Models
Agent
Devs
Model API
docs
ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization | Read Paper on Bytez