Connecting Humanities and Social Sciences: Applying Language and Speech   Technology to Online Panel Surveys

Henk van den Heuvel; Martijn Bentum; Simone Wills; Judith C. Koops

arXiv:2302.10593·cs.CL·February 22, 2023·1 cites

Connecting Humanities and Social Sciences: Applying Language and Speech Technology to Online Panel Surveys

Henk van den Heuvel, Martijn Bentum, Simone Wills, Judith C. Koops

PDF

Open Access

TL;DR

This study investigates the use of speech recognition and transformer-based models to analyze open-ended survey responses, comparing spoken and typed answers to evaluate accuracy and feasibility in social science research.

Contribution

It demonstrates the application of speech technology and transformer models to analyze open-ended survey data, highlighting their potential and limitations.

Findings

01

ASR errors impact downstream analysis accuracy

02

Transformer models perform well without target-specific training

03

Spoken responses can be effectively analyzed with current technology

Abstract

In this paper, we explore the application of language and speech technology to open-ended questions in a Dutch panel survey. In an experimental wave respondents could choose to answer open questions via speech or keyboard. Automatic speech recognition (ASR) was used to process spoken responses. We evaluated answers from these input modalities to investigate differences between spoken and typed answers.We report the errors the ASR system produces and investigate the impact of these errors on downstream analyses. Open-ended questions give more freedom to answer for respondents, but entail a non-trivial amount of work to analyse. We evaluated the feasibility of using transformer-based models (e.g. BERT) to apply sentiment analysis and topic modelling on the answers of open questions. A big advantage of transformer-based models is that they are trained on a large amount of language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSurvey Methodology and Nonresponse · Expert finding and Q&A systems · Speech and dialogue systems