# Crowd Sourced Data Analysis: Mapping of Programming Concepts to   Syntactical Patterns

**Authors:** Deepak Thukral, Darvesh Punia

arXiv: 1903.12495 · 2019-04-01

## TL;DR

This paper presents a method to improve code search by mapping programming concepts to natural language keywords using Stack Overflow data, enabling more intuitive natural language-based code retrieval.

## Contribution

It introduces a novel approach to associate programming concepts with natural language terms and tags code lines accordingly, enhancing code search capabilities.

## Key findings

- Improved accuracy in natural language code search.
- Effective mapping of concepts to keywords using Stack Overflow data.
- Enhanced developer productivity in locating code snippets.

## Abstract

Since programming concepts do not match their syntactic representations, code search is a very tedious task. For instance in Java or C, array doesn't match [], so using "array" as a query, one cannot find what they are looking for. Often developers have to search code whether to understand any code, or to reuse some part of that code, or just to read it, without natural language searching, developers have to often scroll back and forth or use variable names as their queries. In our work, we have used Stackoverflow (SO) question and answers to make a mapping of programming concepts with their respective natural language keywords, and then tag these natural language terms to every line of code, which can further we used in searching using natural language keywords.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.12495/full.md

---
Source: https://tomesphere.com/paper/1903.12495