Exploring Dimensionality Reduction Techniques in Multilingual Transformers
\'Alvaro Huertas-Garc\'ia, Alejandro Mart\'in, Javier Huertas-Tato,, David Camacho

TL;DR
This paper investigates how various dimensionality reduction techniques affect the performance of multilingual Siamese Transformers on semantic tasks, demonstrating significant dimension reduction with minimal performance loss.
Contribution
It provides a comprehensive analysis of multiple dimensional reduction methods on multilingual transformers, highlighting their impact on semantic task performance and visualization.
Findings
Achieved up to 91.58% dimension reduction with minimal performance degradation.
Dimensionality reduction techniques can effectively simplify high-dimensional embeddings.
Results inform better model tuning and visualization strategies for NLP tasks.
Abstract
Both in scientific literature and in industry,, Semantic and context-aware Natural Language Processing-based solutions have been gaining importance in recent years. The possibilities and performance shown by these models when dealing with complex Language Understanding tasks is unquestionable, from conversational agents to the fight against disinformation in social networks. In addition, considerable attention is also being paid to developing multilingual models to tackle the language bottleneck. The growing need to provide more complex models implementing all these features has been accompanied by an increase in their size, without being conservative in the number of dimensions required. This paper aims to give a comprehensive account of the impact of a wide variety of dimensional reduction techniques on the performance of different state-of-the-art multilingual Siamese Transformers,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Natural Language Processing Techniques
