Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Charaf Eddine Benarab

TL;DR
This paper explores using pre-trained vision models to classify text by transforming BERT embeddings into images and applying transfer learning, achieving promising sentiment analysis results without extensive compute resources.
Contribution
It introduces a novel approach of converting BERT sentence embeddings into images and leveraging pre-trained vision models for text classification, bridging vision and language domains.
Findings
Effective sentiment analysis achieved with vision models on transformed text data.
Transforming BERT embeddings into images enables cross-domain transfer learning.
Method requires no additional heavy compute resources.
Abstract
Knowledge is acquired by humans through experience, and no boundary is set between the kinds of knowledge or skill levels we can achieve on different tasks at the same time. When it comes to Neural Networks, that is not the case. The breakthroughs in the field are extremely task and domain-specific. Vision and language are dealt with in separate manners, using separate methods and different datasets. Current text classification methods, mostly rely on obtaining contextual embeddings for input text samples, then training a classifier on the embedded dataset. Transfer learning in Language-related tasks in general, is heavily used in obtaining the contextual text embeddings for the input samples. In this work, we propose to use the knowledge acquired by benchmark Vision Models which are trained on ImageNet to help a much smaller architecture learn to classify text. A data transformation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Convolution · Batch Normalization · Average Pooling · Global Average Pooling · 1x1 Convolution · Bottleneck Residual Block
