bytez
Search
Feed
Models
Agent
Devs
Model API
docs
Concentration bounds for temporal difference learning with linear function approximation: The case of batch data and uniform sampling | Read Paper on Bytez