# Detecting Syntactic Change Using a Neural Part-of-Speech Tagger

**Authors:** William Merrill, Gigi Felice Stark, Robert Frank

arXiv: 1906.01661 · 2019-07-11

## TL;DR

This paper trains a neural POS tagger on historical American English to detect syntactic changes over time, revealing that the model implicitly learns temporal and syntactic evolution across centuries.

## Contribution

It introduces a diachronic LSTM POS tagger that captures syntactic change and temporal information, demonstrating transferability to date new sentences.

## Key findings

- Year embeddings correlate with time
- Model predicts sentence dates effectively
- Temporal change is syntactic, not lexical

## Abstract

We train a diachronic long short-term memory (LSTM) part-of-speech tagger on a large corpus of American English from the 19th, 20th, and 21st centuries. We analyze the tagger's ability to implicitly learn temporal structure between years, and the extent to which this knowledge can be transferred to date new sentences. The learned year embeddings show a strong linear correlation between their first principal component and time. We show that temporal information encoded in the model can be used to predict novel sentences' years of composition relatively well. Comparisons to a feedforward baseline suggest that the temporal change learned by the LSTM is syntactic rather than purely lexical. Thus, our results suggest that our tagger is implicitly learning to model syntactic change in American English over the course of the 19th, 20th, and early 21st centuries.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.01661/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1906.01661/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/1906.01661/full.md

---
Source: https://tomesphere.com/paper/1906.01661