A Survey of Voice Translation Methodologies - Acoustic Dialect Decoder
Hans Krupakar, Keerthika Rajvel, Bharathi B, Angel Deborah S,, Vallidevi Krishnamurthy

TL;DR
This paper presents the Acoustic Dialect Decoder, a voice-to-voice translation device that integrates speech recognition, translation, and synthesis to enable real-time language translation, initially from English to Tamil.
Contribution
It introduces a comprehensive survey of recent speech engineering advances applied to a real-time voice translation device, combining recognition, translation, and synthesis components.
Findings
Developed a prototype English to Tamil translation device
Integrated HMM and RNN-based recognition and synthesis modules
Achieved near real-time translation with continuous speech input
Abstract
Speech Translation has always been about giving source text or audio input and waiting for system to give translated output in desired form. In this paper, we present the Acoustic Dialect Decoder (ADD) - a voice to voice ear-piece translation device. We introduce and survey the recent advances made in the field of Speech Engineering, to employ in the ADD, particularly focusing on the three major processing steps of Recognition, Translation and Synthesis. We tackle the problem of machine understanding of natural language by designing a recognition unit for source audio to text, a translation unit for source language text to target language text, and a synthesis unit for target language text to target language speech. Speech from the surroundings will be recorded by the recognition unit present on the ear-piece and translation will start as soon as one sentence is successfully read. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
