Loading paper
Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models | Tomesphere