Loading paper
OPSD Compresses What RLVR Teaches: A Post-RL Compaction Stage for Reasoning Models | Tomesphere