Loading paper
RLSR: Reinforcement Learning from Self Reward | Tomesphere