Hot or Cold? Adaptive Temperature Sampling for Code Generation with   Large Language Models

Yuqi Zhu; Jia Li; Ge Li; YunFei Zhao; Jia Li; Zhi Jin; Hong Mei

arXiv:2309.02772·cs.SE·December 29, 2023·2 cites

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models

Yuqi Zhu, Jia Li, Ge Li, YunFei Zhao, Jia Li, Zhi Jin, Hong Mei

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Adaptive Temperature sampling, a novel decoding strategy for code generation with large language models that dynamically adjusts sampling temperature based on token difficulty, improving performance.

Contribution

It is the first systematic study of decoding strategies tailored for code generation, proposing a dynamic temperature adjustment method based on token difficulty analysis.

Findings

01

AdapT sampling outperforms existing decoding strategies.

02

Challenging tokens mainly occur at the beginning of code blocks.

03

Dynamic temperature adjustment improves code generation quality.

Abstract

Recently, Large Language Models (LLMs) have shown impressive abilities in code generation. However, existing LLMs' decoding strategies are designed for Natural Language (NL) generation, overlooking the differences between NL and programming languages (PL). Due to this oversight, a better decoding strategy for code generation remains an open question. In this paper, we conduct the first systematic study to explore a decoding strategy specialized in code generation. With an analysis of loss distributions of code tokens, we find that code tokens can be divided into two categories: challenging tokens that are difficult to predict and confident tokens that can be easily inferred. Among them, the challenging tokens mainly appear at the beginning of a code block. Inspired by the above findings, we propose a simple yet effective method: Adaptive Temperature (AdapT) sampling, which dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lj2lijia/adapt
mindsporeOfficial

Videos

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models· underline

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques