HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection
Guang Yang, Yujie Zhu

TL;DR
This paper introduces HGAdapter, a hypergraph-based adapter that captures high-order data correlations in code to enhance pre-trained language models for code summarization and clone detection tasks.
Contribution
The paper proposes a novel hypergraph-based adapter that encodes high-order code data correlations and can be integrated into various PLMs for improved performance.
Findings
HGAdapter improves PLM performance on code tasks.
High-order data correlations enhance code understanding.
Experiments across multiple datasets validate effectiveness.
Abstract
Pre-trained language models (PLMs) are increasingly being applied to code-related tasks. Although PLMs have achieved good results, they do not take into account potential high-order data correlations within the code. We propose three types of high-order correlations in code tokens, i.e. abstract syntax tree family correlation, lexical correlation, and line correlation. We design a tokens and hyperedges generator to capture these high-order data correlations. We improve the architecture of hypergraph neural networks and combine it with adapter tuning to propose a novel hypergraph-based adapter (HGAdapter) to fine-tune PLMs. HGAdapter can encode high-order data correlations and is allowed to be inserted into various PLMs to enhance performance. Experiments were conducted on several public datasets, including six languages of code summarization and code clone detection tasks. Our methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Authorship Attribution and Profiling
