bytez
Search

Feed
Models
Agent

Devs

API Dashboard
docs
GitHub

Direct Preference-based Policy Optimization without Reward Modeling
2023
·
NeurIPS