# ROSF: Leveraging Information Retrieval and Supervised Learning for   Recommending Code Snippets

**Authors:** He Jiang, Liming Nie, Zeyi Sun, Zhilei Ren, Weiqiang Kong, Tao Zhang,, Xiapu Luo

arXiv: 1703.01446 · 2017-03-07

## TL;DR

ROSF is a novel approach that combines information retrieval and supervised learning to improve code snippet recommendations, significantly outperforming existing methods in large-scale experiments.

## Contribution

This paper introduces ROSF, a two-stage method integrating IR and supervised learning for more accurate code snippet recommendations.

## Key findings

- ROSF outperforms state-of-the-art methods by 20%-41% in Precision.
- ROSF improves NDCG scores by 13%-33%.
- Effective in large-scale real-world code corpus.

## Abstract

When implementing unfamiliar programming tasks, developers commonly search code examples and learn usage patterns of APIs from the code examples or reuse them by copy-pasting and modifying. For providing high-quality code examples, previous studies present several methods to recommend code snippets mainly based on information retrieval. In this paper, to provide better recommendation results, we propose ROSF, Recommending cOde Snippets with multi-aspect Features, a novel method combining both information retrieval and supervised learning. In our method, we recommend Top-Kcode snippets for a givenfree-form query based on two stages, i.e., coarse-grained searching and fine-grained re-ranking. First, we generate a code snippet candidate set by searching a code snippet corpus using an information retrieval method. Second, we predict probability values of the code snippets for different relevance scores in the candidate set by the learned prediction model from a training set, re-rank these candidate code snippets according to the probability values, and recommend the final results to developers. We conduct several experiments to evaluate our method in a large-scale corpus containing 921,713 real-world code snippets. The results show that ROSF is an effective method for code snippets recommendation and outperforms the-state-of-the-art methods by 20%-41% in Precision and 13%-33% in NDCG

---
Source: https://tomesphere.com/paper/1703.01446