b
Discover
Models
Search
About
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
5 months ago
·
CVPR