KCIF: Knowledge-Conditioned Instruction Following

Rudra Murthy; Praveen Venkateswaran; Prince Kumar; Danish Contractor

arXiv:2410.12972·cs.CL·May 26, 2025

KCIF: Knowledge-Conditioned Instruction Following

Rudra Murthy, Praveen Venkateswaran, Prince Kumar, Danish Contractor

PDF

Open Access 1 Repo

TL;DR

This paper investigates how large language models struggle to follow combined knowledge and instruction tasks, revealing significant performance drops especially in smaller models, and introduces a benchmark to evaluate this interaction.

Contribution

The paper introduces a new benchmark dataset and evaluation framework to study the interaction between knowledge and instruction following in LLMs, highlighting their joint challenges.

Findings

01

Models show a 40-50% performance drop on combined tasks.

02

Smaller models experience performance drops exceeding 80%.

03

Large models still struggle significantly with instruction-knowledge interaction.

Abstract

LLM evaluation benchmarks have traditionally separated the testing of knowledge/reasoning capabilities from instruction following. In this work, we study the interaction between knowledge and instruction following, and observe that LLMs struggle to follow simple answer modifying instructions, and are also distracted by instructions that should have no bearing on the original knowledge task answer. We leverage existing multiple-choice answer based knowledge benchmarks and apply a set of simple instructions which include manipulating text (eg.: change case), numeric quantities (eg.: increase value, change formatting), operate on lists (eg.: sort answer candidates) and distractor instructions (eg.: change case of numeric answers). We evaluate models at varying parameter sizes (1B-405B) from different model families and find that, surprisingly, all models report a significant drop in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ibm/KCIF
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Natural Language Processing Techniques

MethodsSoftmax · Attention Is All You Need · Focus