bytez
Search
Feed
Models
Agent
Devs
Plan
docs
PRIMT: Preference-based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models | Read Paper on Bytez