CursorCore: Assist Programming through Aligning Anything
Hao Jiang, Qi Liu, Rui Li, Shengyu Ye, Shijin Wang

TL;DR
CursorCore introduces a comprehensive conversational framework for programming assistance that integrates multiple information sources, along with a new benchmark and a large dataset, to improve code-related tasks.
Contribution
The paper presents a novel framework, benchmark, and data pipeline for enhancing programming assistance models through better information alignment and automation.
Findings
CursorCore outperforms comparable models in programming tasks.
The new benchmark APEval effectively assesses model alignment and output quality.
Generated 219K training samples to fine-tune models and demonstrate improvements.
Abstract
Large language models have been successfully applied to programming assistance tasks, such as code completion, code insertion, and instructional code editing. However, these applications remain insufficiently automated and struggle to effectively integrate various types of information during the programming process, including coding history, current code, and user instructions. In this work, we propose a new conversational framework that comprehensively integrates these information sources, collect data to train our models and evaluate their performance. Firstly, to thoroughly evaluate how well models align with different types of information and the quality of their outputs, we introduce a new benchmark, APEval (Assist Programming Eval), to comprehensively assess the performance of models in programming assistance tasks. Then, for data collection, we develop a data generation pipeline,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗TechxGenus/CursorCore-Yi-1.5Bmodel· 15 dl· ♡ 115 dl♡ 1
- 🤗TechxGenus/CursorCore-DS-1.3Bmodel· 9 dl9 dl
- 🤗TechxGenus/CursorCore-QW2.5-1.5Bmodel· 251 dl· ♡ 1251 dl♡ 1
- 🤗TechxGenus/CursorCore-Yi-9Bmodel· 9 dl· ♡ 19 dl♡ 1
- 🤗TechxGenus/CursorCore-DS-6.7Bmodel· 9 dl9 dl
- 🤗TechxGenus/CursorCore-QW2.5-7Bmodel· 8 dl· ♡ 88 dl♡ 8
- 🤗TechxGenus/CursorCore-Yi-1.5B-LCmodel· 3 dl3 dl
- 🤗TechxGenus/CursorCore-QW2.5-1.5B-LCmodel· 5 dl5 dl
- 🤗TechxGenus/CursorCore-DS-1.3B-AWQmodel· 3 dl3 dl
- 🤗TechxGenus/CursorCore-Yi-1.5B-AWQmodel· 1 dl1 dl
Videos
Taxonomy
TopicsTeaching and Learning Programming
MethodsALIGN
