Loading paper
Greedy Sampling Is Provably Efficient for RLHF | Tomesphere