Loading paper
Learning to Reason: Training LLMs with GPT-OSS or DeepSeek R1 Reasoning Traces | Tomesphere