CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain, Ho-Hsiang Wu, Tiferet Gazit, Miltiadis Allamanis, Marc, Brockschmidt

TL;DR
This paper introduces the CodeSearchNet Challenge, a large-scale benchmark for evaluating semantic code search methods across multiple programming languages, aiming to advance research in bridging natural language and code retrieval.
Contribution
It provides a comprehensive dataset, evaluation methodology, and baseline solutions for semantic code search, fostering progress through a public challenge and leaderboard.
Findings
Created a dataset with 6 million functions across 6 languages
Annotated 99 natural language queries with expert relevance labels
Launched a benchmark and baseline solutions for the task
Abstract
Semantic code search is the task of retrieving relevant code given a natural language query. While related to other information retrieval tasks, it requires bridging the gap between the language used in code (often abbreviated and highly technical) and natural language more suitable to describe vague concepts and ideas. To enable evaluation of progress on code search, we are releasing the CodeSearchNet Corpus and are presenting the CodeSearchNet Challenge, which consists of 99 natural language queries with about 4k expert relevance annotations of likely results from CodeSearchNet Corpus. The corpus contains about 6 million functions from open-source code spanning six programming languages (Go, Java, JavaScript, PHP, Python, and Ruby). The CodeSearchNet Corpus also contains automatically generated query-like natural language for 2 million functions, obtained from mechanically scraping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Salesforce/codet5-base-multi-summodel· 720 dl· ♡ 32720 dl♡ 32
- 🤗Salesforce/codet5-basemodel· 15k dl· ♡ 13215k dl♡ 132
- 🤗Salesforce/codet5-smallmodel· 14k dl· ♡ 8114k dl♡ 81
- 🤗SaulLu/cotet5_small_fixmodel· 32 dl· ♡ 132 dl♡ 1
- 🤗huggingface/CodeBERTa-language-idmodel· 639 dl· ♡ 68639 dl♡ 68
- 🤗huggingface/CodeBERTa-small-v1model· 29k dl· ♡ 8829k dl♡ 88
- 🤗mrm8488/CodeBERTaPymodel· 37 dl· ♡ 437 dl♡ 4
- 🤗mrm8488/codeBERTaJSmodel· 39 dl· ♡ 639 dl♡ 6
- 🤗Salesforce/codet5-largemodel· 2.0k dl· ♡ 742.0k dl♡ 74
- 🤗Salesforce/codet5-large-ntp-pymodel· 475 dl· ♡ 27475 dl♡ 27
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Reliability and Analysis Research
