LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Lu\'isa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, and Sara Hooker

TL;DR
This paper investigates how synthetic data influences large language models' biases and attributes, and introduces active inheritance as a method to steer models towards desired non-differentiable properties through controlled data generation.
Contribution
The paper systematically studies the impact of synthetic data on LLMs and proposes active inheritance to explicitly guide model properties via data generation.
Findings
Models are sensitive to certain synthetic data attributes even when prompts are neutral.
Active inheritance can steer models towards desired non-differentiable attributes.
Synthetic data source significantly influences models' biases and textual attributes.
Abstract
The widespread adoption of synthetic data raises new questions about how models generating the data can influence other large language models (LLMs) via distilled data. To start, our work exhaustively characterizes the impact of passive inheritance of model properties by systematically studying the consequences of synthetic data integration. We provide one of the most comprehensive studies to-date of how the source of synthetic data shapes models' internal biases, calibration and generations' textual attributes and preferences. We find that models are surprisingly sensitive towards certain attributes even when the synthetic data prompts appear "neutral". which invites the question whether this sensitivity can be exploited for good. Our findings invite the question can we explicitly steer the models towards the properties we want at test time by exploiting the data generation process?…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing
