KCIF: Knowledge-Conditioned Instruction Following
Rudra Murthy, Praveen Venkateswaran, Prince Kumar, Danish Contractor

TL;DR
This paper investigates how large language models struggle to follow combined knowledge and instruction tasks, revealing significant performance drops especially in smaller models, and introduces a benchmark to evaluate this interaction.
Contribution
The paper introduces a new benchmark dataset and evaluation framework to study the interaction between knowledge and instruction following in LLMs, highlighting their joint challenges.
Findings
Models show a 40-50% performance drop on combined tasks.
Smaller models experience performance drops exceeding 80%.
Large models still struggle significantly with instruction-knowledge interaction.
Abstract
LLM evaluation benchmarks have traditionally separated the testing of knowledge/reasoning capabilities from instruction following. In this work, we study the interaction between knowledge and instruction following, and observe that LLMs struggle to follow simple answer modifying instructions, and are also distracted by instructions that should have no bearing on the original knowledge task answer. We leverage existing multiple-choice answer based knowledge benchmarks and apply a set of simple instructions which include manipulating text (eg.: change case), numeric quantities (eg.: increase value, change formatting), operate on lists (eg.: sort answer candidates) and distractor instructions (eg.: change case of numeric answers). We evaluate models at varying parameter sizes (1B-405B) from different model families and find that, surprisingly, all models report a significant drop in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need · Focus
