b
Discover
Models
Search
About
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
2023
·
NeurIPS