TL;DR
This study investigates how effectively developers use general-purpose search engines like Google for code-related searches, revealing that code searches tend to require more effort and are less optimized compared to non-code searches.
Contribution
The paper introduces a novel classifier for distinguishing code-related from non-code-related search queries and analyzes a large dataset to evaluate search effort differences.
Findings
Code searches require more effort than non-code searches.
Google's search performance is less effective for code-related queries.
A high-accuracy classifier was developed to identify code intent in queries.
Abstract
Search is an integral part of a software development process. Developers often use search engines to look for information during development, including reusable code snippets, API understanding, and reference examples. Developers tend to prefer general-purpose search engines like Google, which are often not optimized for code related documents and use search strategies and ranking techniques that are more optimized for generic, non-code related information. In this paper, we explore whether a general purpose search engine like Google is an optimal choice for code-related searches. In particular, we investigate whether the performance of searching with Google varies for code vs. non-code related searches. To analyze this, we collect search logs from 310 developers that contains nearly 150,000 search queries from Google and the associated result clicks. To differentiate between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
