Can Large Language Models Understand Context?

Yilun Zhu; Joel Ruben Antony Moniz; Shruti Bhargava; Jiarui Lu; Dhivya; Piraviperumal; Site Li; Yuan Zhang; Hong Yu; Bo-Hsiang Tseng

arXiv:2402.00858·cs.CL·February 2, 2024·6 cites

Can Large Language Models Understand Context?

Yilun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya, Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng

PDF

Open Access

TL;DR

This paper introduces a benchmark to evaluate large language models' ability to understand context, revealing that fine-tuned models outperform pre-trained dense models, and quantization impacts performance variably.

Contribution

It presents a new benchmark with four tasks and nine datasets for assessing contextual understanding in LLMs, including evaluations of pretraining and quantization effects.

Findings

01

Fine-tuned models outperform dense pre-trained models in context understanding.

02

Quantization, especially at 3-bit, reduces models' contextual performance.

03

Extensive analysis confirms the impact of model type and compression on understanding ability.

Abstract

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various domains within the realm of Natural Language Processing, limited attention has been paid to probing their linguistic capability of understanding contextual features. This paper introduces a context understanding benchmark by adapting existing datasets to suit the evaluation of generative models. This benchmark comprises of four distinct tasks and nine datasets, all featuring prompts designed to assess the models' ability to understand context. First, we evaluate the performance of LLMs under the in-context learning pretraining scenario. Experimental results indicate that pre-trained dense models struggle with understanding more nuanced contextual features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques