Loading paper
Satisficing Regret Minimization in Bandits: Constant Rate and Light-Tailed Distribution | Tomesphere