Loading paper
Less is more: Not all samples are effective for evaluation | Tomesphere