b

DiscoverModelsSearch
About
Approximate Thompson Sampling for Learning Linear Quadratic Regulators with O(T)O(\sqrt{T}) Regret
7 months ago
·
arXiv