TriagerX: Dual Transformers for Bug Triaging Tasks with Content and Interaction Based Rankings
Md Afif Al Mamun, Gias Uddin, Lan Xia, Longyu Zhang

TL;DR
TriagerX introduces a dual-transformer architecture combined with interaction-based ranking to improve bug triaging accuracy, outperforming state-of-the-art methods across multiple datasets and successfully deployed in industry.
Contribution
The paper proposes TriagerX, a novel dual-transformer model with interaction-based ranking for bug triaging, enhancing semantic understanding and incorporating developer interaction history.
Findings
TriagerX outperforms nine transformer-based baselines in Top-1 and Top-3 accuracy.
Achieves over 10% improvement in developer recommendation accuracy.
Successfully deployed in an industrial environment for bug and component recommendations.
Abstract
Pretrained Language Models or PLMs are transformer-based architectures that can be used in bug triaging tasks. PLMs can better capture token semantics than traditional Machine Learning (ML) models that rely on statistical features (e.g., TF-IDF, bag of words). However, PLMs may still attend to less relevant tokens in a bug report, which can impact their effectiveness. In addition, the model can be sub-optimal with its recommendations when the interaction history of developers around similar bugs is not taken into account. We designed TriagerX to address these limitations. First, to assess token semantics more reliably, we leverage a dual-transformer architecture. Unlike current state-of-the-art (SOTA) baselines that employ a single transformer architecture, TriagerX collects recommendations from two transformers with each offering recommendations via its last three layers. This setup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
