HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection

Guang Yang; Yujie Zhu

arXiv:2510.17591·cs.CL·November 27, 2025

HGAdapter: Hypergraph-based Adapters in Language Models for Code Summarization and Clone Detection

Guang Yang, Yujie Zhu

PDF

Open Access

TL;DR

This paper introduces HGAdapter, a hypergraph-based adapter that captures high-order data correlations in code to enhance pre-trained language models for code summarization and clone detection tasks.

Contribution

The paper proposes a novel hypergraph-based adapter that encodes high-order code data correlations and can be integrated into various PLMs for improved performance.

Findings

01

HGAdapter improves PLM performance on code tasks.

02

High-order data correlations enhance code understanding.

03

Experiments across multiple datasets validate effectiveness.

Abstract

Pre-trained language models (PLMs) are increasingly being applied to code-related tasks. Although PLMs have achieved good results, they do not take into account potential high-order data correlations within the code. We propose three types of high-order correlations in code tokens, i.e. abstract syntax tree family correlation, lexical correlation, and line correlation. We design a tokens and hyperedges generator to capture these high-order data correlations. We improve the architecture of hypergraph neural networks and combine it with adapter tuning to propose a novel hypergraph-based adapter (HGAdapter) to fine-tune PLMs. HGAdapter can encode high-order data correlations and is allowed to be inserted into various PLMs to enhance performance. Experiments were conducted on several public datasets, including six languages of code summarization and code clone detection tasks. Our methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Authorship Attribution and Profiling