bytez
Search

Feed
Models
Agent

Devs

API Dashboard
docs
GitHub

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
7 days ago
·
arXiv