Toward Neurosymbolic Program Comprehension

Alejandro Velasco; Aya Garryyeva; David N. Palacio; Antonio; Mastropaolo; Denys Poshyvanyk

arXiv:2502.01806·cs.SE·February 5, 2025·2 cites

Toward Neurosymbolic Program Comprehension

Alejandro Velasco, Aya Garryyeva, David N. Palacio, Antonio, Mastropaolo, Denys Poshyvanyk

PDF

Open Access

TL;DR

This paper advocates for a Neurosymbolic approach to program comprehension, combining deep learning models with symbolic methods to improve reliability, interpretability, and efficiency in software engineering tasks.

Contribution

It introduces the concept of Neurosymbolic Program Comprehension (NsPC), proposing a hybrid framework that integrates neural and symbolic techniques for better code analysis.

Findings

01

Preliminary results show promise for the hybrid approach.

02

The framework aims to enhance trustworthiness and interpretability.

03

Challenges in scaling large models motivate the Neurosymbolic direction.

Abstract

Recent advancements in Large Language Models (LLMs) have paved the way for Large Code Models (LCMs), enabling automation in complex software engineering tasks, such as code generation, software testing, and program comprehension, among others. Tools like GitHub Copilot and ChatGPT have shown substantial benefits in supporting developers across various practices. However, the ambition to scale these models to trillion-parameter sizes, exemplified by GPT-4, poses significant challenges that limit the usage of Artificial Intelligence (AI)-based systems powered by large Deep Learning (DL) models. These include rising computational demands for training and deployment and issues related to trustworthiness, bias, and interpretability. Such factors can make managing these models impractical for many organizations, while their "black-box'' nature undermines key aspects, including transparency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeuroscience, Education and Cognitive Function · Educational and Psychological Assessments

MethodsAttention Is All You Need · Label Smoothing · Layer Normalization · Linear Layer · Byte Pair Encoding · Dense Connections · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam