Loading paper
Thompson Sampling Achieves $\tilde O(\sqrt{T})$ Regret in Linear Quadratic Control | Tomesphere