Loading paper
Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment | Tomesphere