On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation
Gal Patel, Leshem Choshen, Omri Abend

TL;DR
This paper investigates how neural machine translation models encode sentence structure, revealing that superficial cues influence neuron activations and demonstrating partial control over syntactic output through neuron manipulation.
Contribution
The study introduces a model-agnostic methodology to analyze sentence structure encoding in neural translation models and develops a semi-automatic paraphrase generation technique.
Findings
Neuron activation similarity is mainly due to word choice and sentence length.
Neuron manipulation can influence syntactic form of translation output.
Deep models encode sentence-structure distinctions despite neuron-level evidence being limited.
Abstract
We present a methodology that explores how sentence structure is reflected in neural representations of machine translation systems. We demonstrate our model-agnostic approach with the Transformer English-German translation model. We analyze neuron-level correlation of activations between paraphrases while discussing the methodology challenges and the need for confound analysis to isolate the effects of shallow cues. We find that similarity between activation patterns can be mostly accounted for by similarity in word choice and sentence length. Following that, we manipulate neuron activations to control the syntactic form of the output. We show this intervention to be somewhat successful, indicating that deep models capture sentence-structure distinctions, despite finding no such indication at the neuron level. To conduct our experiments, we develop a semi-automatic method to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Label Smoothing · Softmax · Dense Connections · Absolute Position Encodings · Residual Connection
