Attention Mechanism and Heuristic Approach: Context-Aware File Ranking Using Multi-Head Self-Attention
Pradeep Kumar Sharma, Shantanu Godbole, Sarada Prasad Jena, Hritvik Shrivastava

TL;DR
This paper introduces a context-aware file ranking method using Multi-Head Self-Attention to improve recall in change impact analysis, effectively modeling feature dependencies and enhancing expert-like reasoning.
Contribution
It applies Multi-Head Self-Attention as a post-processing step to refine deterministic scores, capturing contextual feature relationships and improving recall in impacted file identification.
Findings
Top-50 recall improved from 62-65% to 78-82%.
Achieved 80% recall at Top-50 files.
Expert validation score increased from 6.5/10 to 8.6/10.
Abstract
The identification and ranking of impacted files within software reposi-tories is a key challenge in change impact analysis. Existing deterministic approaches that combine heuristic signals, semantic similarity measures, and graph-based centrality metrics have demonstrated effectiveness in nar-rowing candidate search spaces, yet their recall plateaus. This limitation stems from the treatment of features as linearly independent contributors, ignoring contextual dependencies and relationships between metrics that characterize expert reasoning patterns. To address this limitation, we propose the application of Multi-Head Self-Attention as a post-deterministic scoring refinement mechanism. Our approach learns contextual weighting between features, dynamically adjust-ing importance levels per file based on relational behavior exhibited across candidate file sets. The attention mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Scientific Computing and Data Management
