CodeT5-RNN: Reinforcing Contextual Embeddings for Enhanced Code Comprehension
Md Mostafizer Rahman, Ariful Islam Shiplu, Yutaka Watanobe, Md Faizul Ibne Amin, Syed Rameez Naqvi, Fang Liu

TL;DR
This paper introduces a hybrid LLM-RNN framework that refines contextual embeddings with RNNs to improve code comprehension, demonstrating significant accuracy improvements on defect detection benchmarks.
Contribution
The paper proposes a novel hybrid model combining LLMs with RNNs to enhance long-range dependency capture in code understanding tasks.
Findings
Hybrid models outperform standalone LLMs in defect detection accuracy.
Reprocessing embeddings with RNNs improves order-sensitive dependency modeling.
Models show consistent improvements across multiple real-world datasets.
Abstract
Contextual embeddings generated by LLMs exhibit strong positional inductive biases, which can limit their ability to fully capture long-range, order-sensitive dependencies in highly structured source code. Consequently, how to further refine and enhance LLM embeddings for improved code understanding remains an open research question. To address this gap, we propose a hybrid LLM-RNN framework that reinforces LLM-generated contextual embeddings with a sequential RNN architecture. The embeddings reprocessing step aims to reinforce sequential semantics and strengthen order-aware dependencies inherent in source code. We evaluate the proposed hybrid models on both benchmark and real-world coding datasets. The experimental results show that the RoBERTa-BiGRU and CodeBERT-GRU models achieved accuracies of 66.40% and 66.03%, respectively, on the defect detection benchmark dataset, representing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Misinformation and Its Impacts
