Perceptimatic: A human speech perception benchmark for unsupervised   subword modelling

Juliette Millet; Ewan Dunbar

arXiv:2010.05961·cs.CL·October 14, 2020

Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

Juliette Millet, Ewan Dunbar

PDF

1 Repo

TL;DR

This paper introduces Perceptimatic, a benchmark dataset for comparing human speech perception with models on phone discrimination tasks, highlighting differences between model and human perceptual spaces.

Contribution

It provides a new open dataset and a method to compare human and model speech perception, applied to existing models from the Zero Resource Speech Challenge.

Findings

01

Supervised monolingual HMM-GMM models differ from human perceptual space.

02

Unsupervised and multilingual models show different perceptual representations.

03

The dataset enables detailed comparison of human and model speech perception.

Abstract

In this paper, we present a data set and methods to compare speech processing models and human behaviour on a phone discrimination task. We provide Perceptimatic, an open data set which consists of French and English speech stimuli, as well as the results of 91 English- and 93 French-speaking listeners. The stimuli test a wide range of French and English contrasts, and are extracted directly from corpora of natural running read speech, used for the 2017 Zero Resource Speech Challenge. We provide a method to compare humans' perceptual space with models' representational space, and we apply it to models previously submitted to the Challenge. We show that, unlike unsupervised models and supervised multilingual models, a standard supervised monolingual HMM-GMM phone recognition system, while good at discriminating phones, yields a representational space very different from that of human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JAMJU/interspeech-2020-perceptimatic
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.