Inducing Character-level Structure in Subword-based Language Models with   Type-level Interchange Intervention Training

Jing Huang; Zhengxuan Wu; Kyle Mahowald; and Christopher Potts

arXiv:2212.09897·cs.CL·December 20, 2023

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

Jing Huang, Zhengxuan Wu, Kyle Mahowald, and Christopher Potts

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal intervention framework to improve subword-based language models' ability to handle character-level tasks by learning interpretable character representations, enhancing robustness and performance on complex form-meaning tasks.

Contribution

It develops a novel causal intervention method for character representation learning within subword models and introduces new character-level tasks to evaluate their capabilities.

Findings

01

Outperforms standard models on complex character tasks

02

Improves robustness to unseen token sequences

03

Produces human-interpretable character representations

Abstract

Language tasks involving character-level manipulations (e.g., spelling corrections, arithmetic operations, word games) are challenging for models operating on subword units. To address this, we develop a causal intervention framework to learn robust and interpretable character representations inside subword-based language models. Our method treats each character as a typed variable in a causal model and learns such causal structures by adapting the interchange intervention training method of Geiger et al. (2021). We additionally introduce a suite of character-level tasks that systematically vary in their dependence on meaning and sequence-level context. While character-level models still perform best on purely form-based tasks like string reversal, our method outperforms character-level models on more complex tasks that blend form, meaning, and context, such as spelling correction in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

explanare/char-iit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification