Advancing Text-to-GLOSS Neural Translation Using a Novel Hyper-parameter Optimization Technique
Younes Ouargani, Noussaima El Khattabi

TL;DR
This paper introduces a novel hyper-parameter optimization method to enhance transformer-based neural machine translation for text-to-GLOSS, significantly improving translation accuracy for low-resource deaf communication data.
Contribution
The study presents a new hyper-parameter exploration technique that tailors transformer architectures for low-resource text-to-GLOSS translation, achieving state-of-the-art results.
Findings
Optimal transformer architecture outperforms previous models.
Achieved ROUGE score of 55.18% and BLEU-1 score of 63.6%.
Outperforms previous results on the PHOENIX14T dataset.
Abstract
In this paper, we investigate the use of transformers for Neural Machine Translation of text-to-GLOSS for Deaf and Hard-of-Hearing communication. Due to the scarcity of available data and limited resources for text-to-GLOSS translation, we treat the problem as a low-resource language task. We use our novel hyper-parameter exploration technique to explore a variety of architectural parameters and build an optimal transformer-based architecture specifically tailored for text-to-GLOSS translation. The study aims to improve the accuracy and fluency of Neural Machine Translation generated GLOSS. This is achieved by examining various architectural parameters including layer count, attention heads, embedding dimension, dropout, and label smoothing to identify the optimal architecture for improving text-to-GLOSS translation performance. The experiments conducted on the PHOENIX14T dataset reveal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Subtitles and Audiovisual Media · Natural Language Processing Techniques
MethodsLabel Smoothing
