Enhancing Trust in Language Model-Based Code Optimization through RLHF:   A Research Design

Jingzhi Gong

arXiv:2502.06769·cs.SE·March 19, 2025

Enhancing Trust in Language Model-Based Code Optimization through RLHF: A Research Design

Jingzhi Gong

PDF

Open Access

TL;DR

This paper proposes a research design to improve trust in language model-based code optimization by integrating human feedback through reinforcement learning with human feedback (RLHF), addressing reliability issues like hallucinations.

Contribution

It introduces a novel research framework for enhancing trustworthiness of LMs in code optimization using RLHF, focusing on human-centric reliability improvements.

Findings

01

Framework for integrating human feedback into LM-based code optimization

02

Addressing hallucination issues in language models for software engineering

03

Lays groundwork for future empirical validation of trust-enhancing methods

Abstract

With the rapid advancement of AI, software engineering increasingly relies on AI-driven approaches, particularly language models (LMs), to enhance code performance. However, the trustworthiness and reliability of LMs remain significant challenges due to the potential for hallucinations - unreliable or incorrect responses. To fill this gap, this research aims to develop reliable, LM-powered methods for code optimization that effectively integrate human feedback. This work aligns with the broader objectives of advancing cooperative and human-centric aspects of software engineering, contributing to the development of trustworthy AI-driven solutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques