CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming

Han Deng; Yuan Meng; Shixiang Tang; Wanli Ouyang; Xinzhu Ma

arXiv:2505.12925·cs.SE·October 28, 2025

CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming

Han Deng, Yuan Meng, Shixiang Tang, Wanli Ouyang, Xinzhu Ma

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces CPRet, a comprehensive dataset and benchmark suite for retrieval tasks in competitive programming, addressing issues of problem duplication and evaluation fairness, and proposing specialized retrieval models.

Contribution

The paper presents CPRet, a new dataset and benchmark for problem retrieval, along with two specialized models, improving evaluation and addressing duplication concerns in competitive programming.

Findings

01

High-quality data and test sets enable reliable evaluation.

02

Models achieve strong results on retrieval tasks.

03

Similarity-aware evaluation reveals inflated pass rates.

Abstract

Competitive programming benchmarks are widely used in scenarios such as programming contests and large language model assessments. However, the growing presence of duplicate or highly similar problems raises concerns not only about competition fairness, but also about the validity of competitive programming as a benchmark for model evaluation. In this paper, we propose a new problem, similar question retrieval, to tackle this issue. Due to the lack of both data and models, solving this problem is challenging. To this end, we introduce CPRet, a retrieval-oriented benchmark suite for competitive programming, covering four retrieval tasks: two code-centric (i.e., Text-to-Code, Code-to-Code) and two newly proposed problem-centric tasks (i.e., Problem-to-Duplicate, Simplified-to-Full) built from a combination of automatically crawled problem-solution data and manually curated annotations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

coldchair/cpret
jaxOfficial

Datasets

coldchair16/CPRet-data
dataset· 97 dl
97 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Mobile Crowdsensing and Crowdsourcing