Toward Neurosymbolic Program Comprehension
Alejandro Velasco, Aya Garryyeva, David N. Palacio, Antonio, Mastropaolo, Denys Poshyvanyk

TL;DR
This paper advocates for a Neurosymbolic approach to program comprehension, combining deep learning models with symbolic methods to improve reliability, interpretability, and efficiency in software engineering tasks.
Contribution
It introduces the concept of Neurosymbolic Program Comprehension (NsPC), proposing a hybrid framework that integrates neural and symbolic techniques for better code analysis.
Findings
Preliminary results show promise for the hybrid approach.
The framework aims to enhance trustworthiness and interpretability.
Challenges in scaling large models motivate the Neurosymbolic direction.
Abstract
Recent advancements in Large Language Models (LLMs) have paved the way for Large Code Models (LCMs), enabling automation in complex software engineering tasks, such as code generation, software testing, and program comprehension, among others. Tools like GitHub Copilot and ChatGPT have shown substantial benefits in supporting developers across various practices. However, the ambition to scale these models to trillion-parameter sizes, exemplified by GPT-4, poses significant challenges that limit the usage of Artificial Intelligence (AI)-based systems powered by large Deep Learning (DL) models. These include rising computational demands for training and deployment and issues related to trustworthiness, bias, and interpretability. Such factors can make managing these models impractical for many organizations, while their "black-box'' nature undermines key aspects, including transparency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience, Education and Cognitive Function · Educational and Psychological Assessments
MethodsAttention Is All You Need · Label Smoothing · Layer Normalization · Linear Layer · Byte Pair Encoding · Dense Connections · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam
