b
Discover
Models
Search
About
SimPO: Simple Preference Optimization with a Reference-Free Reward
7 months ago
·
arXiv