A Survey of Voice Translation Methodologies - Acoustic Dialect Decoder

Hans Krupakar; Keerthika Rajvel; Bharathi B; Angel Deborah S,; Vallidevi Krishnamurthy

arXiv:1610.03934·cs.CL·December 1, 2016

A Survey of Voice Translation Methodologies - Acoustic Dialect Decoder

Hans Krupakar, Keerthika Rajvel, Bharathi B, Angel Deborah S,, Vallidevi Krishnamurthy

PDF

TL;DR

This paper presents the Acoustic Dialect Decoder, a voice-to-voice translation device that integrates speech recognition, translation, and synthesis to enable real-time language translation, initially from English to Tamil.

Contribution

It introduces a comprehensive survey of recent speech engineering advances applied to a real-time voice translation device, combining recognition, translation, and synthesis components.

Findings

01

Developed a prototype English to Tamil translation device

02

Integrated HMM and RNN-based recognition and synthesis modules

03

Achieved near real-time translation with continuous speech input

Abstract

Speech Translation has always been about giving source text or audio input and waiting for system to give translated output in desired form. In this paper, we present the Acoustic Dialect Decoder (ADD) - a voice to voice ear-piece translation device. We introduce and survey the recent advances made in the field of Speech Engineering, to employ in the ADD, particularly focusing on the three major processing steps of Recognition, Translation and Synthesis. We tackle the problem of machine understanding of natural language by designing a recognition unit for source audio to text, a translation unit for source language text to target language text, and a synthesis unit for target language text to target language speech. Speech from the surroundings will be recorded by the recognition unit present on the ear-piece and translation will start as soon as one sentence is successfully read. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.