Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Pradyoth Hegde; Santosh Kesiraju; Jan \v{S}vec; \v{S}imon Sedl\'a\v{c}ek; Bolaji Yusuf; Old\v{r}ich Plchot; Deepak K T; Jan \v{C}ernock\'y

arXiv:2506.08753·cs.CL·June 11, 2025

Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Pradyoth Hegde, Santosh Kesiraju, Jan \v{S}vec, \v{S}imon Sedl\'a\v{c}ek, Bolaji Yusuf, Old\v{r}ich Plchot, Deepak K T, Jan \v{C}ernock\'y

PDF

Open Access

TL;DR

This paper investigates how various factors influence the effectiveness of in-context learning in large language models for dialogue state tracking, using a systematic approach with different models and demonstration selection methods.

Contribution

It introduces a systematic analysis of demonstration selection and prompt context factors affecting in-context learning performance in dialogue state tracking tasks.

Findings

01

Demonstration selection significantly impacts DST performance.

02

Prompt context length influences model accuracy.

03

Different models exhibit varying sensitivity to demonstration quality.

Abstract

This study explores the application of in-context learning (ICL) to the dialogue state tracking (DST) problem and investigates the factors that influence its effectiveness. We use a sentence embedding based k-nearest neighbour method to retrieve the suitable demonstrations for ICL. The selected demonstrations, along with the test samples, are structured within a template as input to the LLM. We then conduct a systematic study to analyse the impact of factors related to demonstration selection and prompt context on DST performance. This work is conducted using the MultiWoZ2.4 dataset and focuses primarily on the OLMo-7B-instruct, Mistral-7B-Instruct-v0.3, and Llama3.2-3B-Instruct models. Our findings provide several useful insights on in-context learning abilities of LLMs for dialogue state tracking.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning

MethodsDynamic Sparse Training