Predicting Sentence Acceptability Judgments in Multimodal Contexts

Hyewon Jang; Nikolai Ilinykh; Sharid Lo\'aiciga; Jey Han Lau; Shalom Lappin

arXiv:2602.20918·cs.AI·February 25, 2026

Predicting Sentence Acceptability Judgments in Multimodal Contexts

Hyewon Jang, Nikolai Ilinykh, Sharid Lo\'aiciga, Jey Han Lau, Shalom Lappin

PDF

Open Access

TL;DR

This study investigates how visual context influences sentence acceptability judgments in humans and large language models, revealing minimal impact on humans but notable effects on model predictions and internal representations.

Contribution

It demonstrates that visual images have little effect on human judgments but significantly influence LLM predictions and internal representations, highlighting differences in multimodal processing.

Findings

01

Humans' acceptability ratings are unaffected by visual context.

02

LLMs' predictions are slightly better without visual context.

03

Model judgments vary, with Qwen resembling human patterns.

Abstract

Previous work has examined the capacity of deep neural networks (DNNs), particularly transformers, to predict human sentence acceptability judgments, both independently of context, and in document contexts. We consider the effect of prior exposure to visual images (i.e., visual context) on these judgments for humans and large language models (LLMs). Our results suggest that, in contrast to textual context, visual images appear to have little if any impact on human acceptability ratings. However, LLMs display the compression effect seen in previous work on human judgments in document contexts. Different sorts of LLMs are able to predict human acceptability judgments to a high degree of accuracy, but in general, their performance is slightly better when visual contexts are removed. Moreover, the distribution of LLM judgments varies among models, with Qwen resembling human patterns, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Neurobiology of Language and Bilingualism