JSSS: free Japanese speech corpus for summarization and simplification

Shinnosuke Takamichi; Mamoru Komachi; Naoko Tanji; Hiroshi Saruwatari

arXiv:2010.01793·eess.AS·October 6, 2020

JSSS: free Japanese speech corpus for summarization and simplification

Shinnosuke Takamichi, Mamoru Komachi, Naoko Tanji, Hiroshi Saruwatari

PDF

Open Access 1 Repo

TL;DR

This paper introduces JSSS, a new Japanese speech corpus designed for speech-based summarization and simplification tasks, featuring recordings for duration-constrained and style-simplified speech, supporting research in information delivery.

Contribution

The paper presents the creation and design of a novel Japanese speech corpus specifically tailored for summarization and simplification tasks in speech processing.

Findings

01

Corpus enables research on speech summarization and simplification.

02

Includes recordings for duration-constrained and style-simplified speech.

03

Supports long-form sentence utterances for comprehensive analysis.

Abstract

In this paper, we construct a new Japanese speech corpus for speech-based summarization and simplification, "JSSS" (pronounced "j-triple-s"). Given the success of reading-style speech synthesis from short-form sentences, we aim to design more difficult tasks for delivering information to humans. Our corpus contains voices recorded for two tasks that have a role in providing information under constraints: duration-constrained text-to-speech summarization and speaking-style simplification. It also contains utterances of long-form sentences as an optional task. This paper describes how we designed the corpus, which is available on our project page.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tarepan/jsss
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification