The Role of Interpretable Patterns in Deep Learning for Morphology

Judit Acs; Andras Kornai

arXiv:2012.04575·cs.CL·December 9, 2020

The Role of Interpretable Patterns in Deep Learning for Morphology

Judit Acs, Andras Kornai

PDF

Open Access 1 Repo

TL;DR

This paper explores how character pattern recognition within a modified sequence-to-sequence model can improve understanding of morphological analysis, lemmatization, and copying tasks across multiple languages.

Contribution

It introduces a pattern matching encoder that identifies important subwords for different tasks and compares their roles using a novel similarity metric across languages.

Findings

01

Patterns reveal task-specific subword importance.

02

Similarity scores show relationships between tasks.

03

Method enhances interpretability of deep models.

Abstract

We examine the role of character patterns in three tasks: morphological analysis, lemmatization and copy. We use a modified version of the standard sequence-to-sequence model, where the encoder is a pattern matching network. Each pattern scores all possible N character long subwords (substrings) on the source side, and the highest scoring subword's score is used to initialize the decoder as well as the input to the attention mechanism. This method allows learning which subwords of the input are important for generating the output. By training the models on the same source but different target, we can compare what subwords are important for different tasks and how they relate to each other. We define a similarity metric, a generalized form of the Jaccard similarity, and assign a similarity score to each pair of the three tasks that work on the same source but may differ in target. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

juditacs/deep-morphology
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Biomedical Text Mining and Ontologies