b
Discover
Models
Search
About
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
2 weeks ago
·
NeurIPS