Subliminal Effects in Your Data: A General Mechanism via Log-Linearity

Ishaq Aden-Ali; Noah Golowich; Allen Liu; Abhishek Shetty; Ankur Moitra; Nika Haghtalab

arXiv:2602.04863·cs.LG·February 5, 2026

Subliminal Effects in Your Data: A General Mechanism via Log-Linearity

Ishaq Aden-Ali, Noah Golowich, Allen Liu, Abhishek Shetty, Ankur Moitra, Nika Haghtalab

PDF

Open Access

TL;DR

This paper introduces a general mechanism called Logit-Linear-Selection (LLS) that reveals hidden effects in datasets, influencing large language models' behaviors in ways not directly observable from individual data points.

Contribution

The paper proposes LLS, a novel method to select data subsets that induce hidden behaviors in LLMs, advancing understanding of dataset effects on model properties.

Findings

01

LLS can elicit specific preferences in models

02

Models respond to prompts in new languages due to dataset effects

03

Hidden behaviors persist across different model architectures

Abstract

Training modern large language models (LLMs) has become a veritable smorgasbord of algorithms and datasets designed to elicit particular behaviors, making it critical to develop techniques to understand the effects of datasets on the model's properties. This is exacerbated by recent experiments that show datasets can transmit signals that are not directly observable from individual datapoints, posing a conceptual challenge for dataset-centric understandings of LLM training and suggesting a missing fundamental account of such phenomena. Towards understanding such effects, inspired by recent work on the linear structure of LLMs, we uncover a general mechanism through which hidden subtexts can arise in generic datasets. We introduce Logit-Linear-Selection (LLS), a method that prescribes how to select subsets of a generic preference dataset to elicit a wide range of hidden effects. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)