Neural FOXP2 -- Language Specific Neuron Steering for Targeted Language Improvement in LLMs
Anusa Saha, Tanmay Joshi, Vinija Jain, Aman Chadha, Amitava Das

TL;DR
This paper introduces Neural FOXP2, a method to selectively steer language-specific neurons in multilingual large language models, enabling targeted language defaultness control for languages like Hindi and Spanish.
Contribution
It presents a novel three-stage approach combining localization, spectral analysis, and targeted activation shifts to control language defaultness in LLMs.
Findings
Successfully localizes language neurons in LLMs.
Identifies stable spectral directions for language steering.
Achieves controllable language defaultness in experiments.
Abstract
LLMs are multilingual by training, yet their lingua franca is often English, reflecting English language dominance in pretraining. Other languages remain in parametric memory but are systematically suppressed. We argue that language defaultness is governed by a sparse, low-rank control circuit, language neurons, that can be mechanistically isolated and safely steered. We introduce Neural FOXP2, that makes a chosen language (Hindi or Spanish) primary in a model by steering language-specific neurons. Neural FOXP2 proceeds in three stages: (i) Localize: We train per-layer SAEs so each activation decomposes into a small set of active feature components. For every feature, we quantify English vs. Hindi/Spanish selectivity overall logit-mass lift toward the target-language token set. Tracing the top-ranked features back to their strongest contributing units yields a compact language-neuron…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Neurobiology of Language and Bilingualism · Neural Networks and Applications
