b
Discover
Models
Search
About
Concentration bounds for temporal difference learning with linear function approximation: The case of batch data and uniform sampling
2013
·
arXiv