MMF3: Neural Code Summarization Based on Multi-Modal Fine-Grained Feature Fusion
Zheng Ma, Yuexiu Gao, Lei Lyu, Chen Lyu

TL;DR
This paper introduces MMF3, a novel neural code summarization model that employs fine-grained multi-modal feature fusion at token and node levels, significantly enhancing summary quality over existing methods.
Contribution
The study proposes a new fine-grained fusion approach for multi-modal code features, improving the alignment and integration of semantic and syntactic information for better code summarization.
Findings
Outperforms state-of-the-art models on Java and Python datasets.
Fine-grained fusion effectively improves summary accuracy.
Ablation studies confirm the importance of the proposed fusion method.
Abstract
Background: Code summarization automatically generates the corresponding natural language descriptions according to the input code. Comprehensiveness of code representation is critical to code summarization task. However, most existing approaches typically use coarse-grained fusion methods to integrate multi-modal features. They generally represent different modalities of a piece of code, such as an Abstract Syntax Tree (AST) and a token sequence, as two embeddings and then fuse the two ones at the AST/code levels. Such a coarse integration makes it difficult to learn the correlations between fine-grained code elements across modalities effectively. Aims: This study intends to improve the model's prediction performance for high-quality code summarization by accurately aligning and fully fusing semantic and syntactic structure information of source code at node/token levels. Method: This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Natural Language Processing Techniques · Topic Modeling
