bytez
Search
Feed
Models
Agent
Devs
Plan
docs
R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models | Read Paper on Bytez