bytez
Search
Feed
Models
Agent
Devs
Model API
docs
Approximate Thompson Sampling for Learning Linear Quadratic Regulators with $O(\sqrt{T})$ Regret | Read Paper on Bytez