Loading paper
Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets | Tomesphere