b
Discover
Models
Search
About
Approximate Thompson Sampling for Learning Linear Quadratic Regulators with
O
(
T
)
O(\sqrt{T})
O
(
T
)
Regret
7 months ago
·
arXiv