Deep Learning Based Natural Language Processing for End to End Speech   Translation

Sarvesh Patil

arXiv:1808.04459·cs.CL·August 15, 2018

Deep Learning Based Natural Language Processing for End to End Speech Translation

Sarvesh Patil

PDF

Open Access

TL;DR

This paper reviews deep learning techniques for end-to-end speech translation, focusing on signal processing and deep recurrent neural networks to improve speech-to-text systems in NLP.

Contribution

It presents an overview of applying deep learning and signal processing methods to develop efficient speech-to-text translation systems.

Findings

01

Deep learning enhances speech translation accuracy.

02

Recurrent neural networks effectively model sequential speech data.

03

Signal processing techniques improve system robustness.

Abstract

Deep Learning methods employ multiple processing layers to learn hierarchial representations of data. They have already been deployed in a humongous number of applications and have produced state-of-the-art results. Recently with the growth in processing power of computers to be able to do high dimensional tensor calculations, Natural Language Processing (NLP) applications have been given a significant boost in terms of efficiency as well as accuracy. In this paper, we will take a look at various signal processing techniques and then application of them to produce a speech-to-text system using Deep Recurrent Neural Networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling