Identity-Based Patterns in Deep Convolutional Networks: Generative   Adversarial Phonology and Reduplication

Ga\v{s}per Begu\v{s}

arXiv:2009.06110·cs.CL·November 23, 2021

Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication

Ga\v{s}per Begu\v{s}

PDF

1 Repo

TL;DR

This paper demonstrates that deep convolutional neural networks can learn and manipulate identity-based patterns like reduplication in speech, revealing insights into neural network interpretability and representation of linguistic patterns.

Contribution

It introduces a novel technique to test CNNs' understanding of reduplication and shows how latent space manipulation can control pattern generation in speech.

Findings

01

CNNs learn to represent identity-based patterns in latent space.

02

Manipulating latent variables can produce reduplicated speech forms.

03

The network generalizes the pattern to unobserved data.

Abstract

This paper models unsupervised learning of an identity-based pattern (or copying) in speech called reduplication from raw continuous data with deep convolutional neural networks. We use the ciwGAN architecture Begu\v{s} (2021a; arXiv:2006.02951) in which learning of meaningful representations in speech emerges from a requirement that the CNNs generate informative data. We propose a technique to wug-test CNNs trained on speech and, based on four generative tests, argue that the network learns to represent an identity-based pattern in its latent space. By manipulating only two categorical variables in the latent space, we can actively turn an unreduplicated form into a reduplicated form with no other substantial changes to the output in the majority of cases. We also argue that the network extends the identity-based pattern to unobserved data. Exploration of how meaningful representations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gbegus/fiwGAN-ciwGAN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.