b
Discover
Models
Search
About
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
7 months ago
·
arXiv