Loading paper
Reward prediction for representation learning and reward shaping | Tomesphere