JAM: Controllable and Responsible Text Generation via Causal Reasoning   and Latent Vector Manipulation

Yingbing Huang; Deming Chen; and Abhishek K. Umrawal

arXiv:2502.20684·cs.CL·March 3, 2025

JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation

Yingbing Huang, Deming Chen, and Abhishek K. Umrawal

PDF

TL;DR

JAM introduces a causal reasoning framework that interprets and controls large language model outputs by manipulating latent vectors, significantly improving responsible and realistic text generation.

Contribution

This paper presents JAM, a novel causal reasoning approach that enhances interpretability and control in LLMs through latent vector manipulation, with improved efficiency and performance.

Findings

01

Up to 22% improvement over previous methods in quantitative metrics

02

Demonstrates greater computational efficiency

03

Achieves responsible and realistic text generation

Abstract

While large language models (LLMs) have made significant strides in generating coherent and contextually relevant text, they often function as opaque black boxes, trained on vast unlabeled datasets with statistical objectives, lacking an interpretable framework for responsible control. In this paper, we introduce JAM (Just A Move), a novel framework that interprets and controls text generation by integrating cause-effect analysis within the latent space of LLMs. Based on our observations, we uncover the inherent causality in LLM generation, which is critical for producing responsible and realistic outputs. Moreover, we explore latent vectors as fundamental components in LLM architectures, aiming to understand and manipulate them for more effective and efficient controllable text generation. We evaluate our framework using a range of tools, including the HHH criteria, toxicity reduction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.