Loading paper
Wasserstein Formulation of Reinforcement Learning. An Optimal Transport Perspective on Policy Optimization | Tomesphere