Loading paper
s3: You Don't Need That Much Data to Train a Search Agent via RL | Tomesphere