ARCLIN: Automated API Mention Resolution for Unformatted Texts
Yintong Huo, Yuxin Su, Hongming Zhang, Michael R. Lyu

TL;DR
This paper introduces ARCLIN, an automated tool that accurately recognizes and links API mentions in informal online forum texts, overcoming challenges like abbreviation, ambiguity, and lack of formal naming.
Contribution
ARCLIN employs a CRF and Bi-LSTM model for API recognition and a context-aware scoring for linking, achieving superior performance without manual annotations.
Findings
ARCLIN outperforms heuristic-based methods by 8% on Py-mention dataset.
It effectively distinguishes API mentions from common words in informal texts.
The tool accurately links API mentions to repository entries without manual supervision.
Abstract
Online technical forums (e.g., StackOverflow) are popular platforms for developers to discuss technical problems such as how to use specific Application Programming Interface (API), how to solve the programming tasks, or how to fix bugs in their codes. These discussions can often provide auxiliary knowledge of how to use the software that is not covered by the official documents. The automatic extraction of such knowledge will support a set of downstream tasks like API searching or indexing. However, unlike official documentation written by experts, discussions in open forums are made by regular developers who write in short and informal texts, including spelling errors or abbreviations. There are three major challenges for the accurate APIs recognition and linking mentioned APIs from unstructured natural language documents to an entry in the API repository: (1) distinguishing API…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
