Loading paper
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs | Tomesphere