b
Discover
Models
Search
About
Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear
q
π
q^\pi
q
π
-Realizability and Concentrability
1 week ago
·
NeurIPS