Investigating Neurons and Heads in Transformer-based LLMs for   Typographical Errors

Kohei Tsuji; Tatsuya Hiraoka; Yuchang Cheng; Eiji Aramaki; Tomoya; Iwakura

arXiv:2502.19669·cs.CL·February 28, 2025

Investigating Neurons and Heads in Transformer-based LLMs for Typographical Errors

Kohei Tsuji, Tatsuya Hiraoka, Yuchang Cheng, Eiji Aramaki, Tomoya, Iwakura

PDF

Open Access 1 Video

TL;DR

This study explores how transformer-based large language models internally recognize and correct typographical errors, revealing specific neurons and attention heads responsible for typo detection and correction across different layers.

Contribution

The paper introduces a method to identify neurons and heads that detect and fix typos, providing insights into internal mechanisms of LLMs for typo correction.

Findings

01

LLMs can correct typos using local context with specific neurons.

02

Middle layer neurons handle global context typo correction.

03

Typo heads consider broad context rather than specific tokens.

Abstract

This paper investigates how LLMs encode inputs with typos. We hypothesize that specific neurons and attention heads recognize typos and fix them internally using local and global contexts. We introduce a method to identify typo neurons and typo heads that work actively when inputs contain typos. Our experimental results suggest the following: 1) LLMs can fix typos with local contexts when the typo neurons in either the early or late layers are activated, even if those in the other are not. 2) Typo neurons in the middle layers are responsible for the core of typo-fixing with global contexts. 3) Typo heads fix typos by widely considering the context not focusing on specific tokens. 4) Typo neurons and typo heads work not only for typo-fixing but also for understanding general contexts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Investigating Neurons and Heads in Transformer-based LLMs for Typographical Errors· underline

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Software Engineering Research · Neurobiology and Insect Physiology Research

MethodsSoftmax · Attention Is All You Need