Real-Time Statistical Speech Translation

Krzysztof Wo{\l}k; Krzysztof Marasek

arXiv:1509.09090·cs.CL·October 1, 2015

Real-Time Statistical Speech Translation

Krzysztof Wo{\l}k, Krzysztof Marasek

PDF

Open Access

TL;DR

This paper explores real-time statistical machine translation for speech, integrating speech recognition and synthesis, and evaluates various data preparation techniques and metrics for the PL-EN language pair.

Contribution

It presents a comprehensive approach to real-time speech translation using statistical models and evaluates multiple data processing methods and metrics.

Findings

01

Data preparation significantly affects translation quality.

02

BLEU, NIST, METEOR, and TER metrics vary in effectiveness.

03

Optimal data and metric choices improve real-time translation performance.

Abstract

This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice communication system between foreigners. We obtained three main data sets from spoken proceedings that represent three different types of human speech. TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for developmental tuning and testing of the translation system. We also conducted experiments involving part of speech tagging, compound splitting, linear language model interpolation, TrueCasing and morphosyntactic analysis. We evaluated the effects of variety of data preparations on the translation results using the BLEU, NIST, METEOR and TER metrics and tried to give answer which metric is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis