AVIATE: Exploiting Translation Variants of Artifacts to Improve IR-based Traceability Recovery in Bilingual Software Projects
Kexin Sun, Yiding Ren, Hongyu Kuang, Hui Gao, Xiaoxing Ma, Guoping, Rong, Dong Shao, He Zhang

TL;DR
AVIATE enhances IR-based traceability recovery in bilingual software projects by exploiting translation variants to address multilingualism and improve artifact text enrichment, leading to significant performance gains.
Contribution
The paper introduces AVIATE, a novel method that leverages translation variants from multiple translators to improve IR-based traceability recovery in bilingual projects.
Findings
AVIATE outperforms state-of-the-art IR approaches with an average 31.43% increase in precision.
The approach significantly improves traceability recovery in multilingual software projects.
Experiments on 17 bilingual projects validate AVIATE's effectiveness.
Abstract
Traceability plays a vital role in facilitating various software development activities by establishing the traces between different types of artifacts (e.g., issues and commits in software repositories). Among the explorations for automated traceability recovery, the IR (Information Retrieval)-based approaches leverage textual similarity to measure the likelihood of traces between artifacts and show advantages in many scenarios. However, the globalization of software development has introduced new challenges, such as the possible multilingualism on the same concept (e.g., "ShuXing" vs. "attribute") in the artifact texts, thus significantly hampering the performance of IR-based approaches. Existing research has shown that machine translation can help address the term inconsistency in bilingual projects. However, the translation can also bring in synonymous terms that are not consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Software Engineering Research
