Classifying Textual Data with Pre-trained Vision Models through Transfer   Learning and Data Transformations

Charaf Eddine Benarab

arXiv:2106.12479·cs.CL·February 8, 2022·1 cites

Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations

Charaf Eddine Benarab

PDF

Open Access 1 Repo

TL;DR

This paper explores using pre-trained vision models to classify text by transforming BERT embeddings into images and applying transfer learning, achieving promising sentiment analysis results without extensive compute resources.

Contribution

It introduces a novel approach of converting BERT sentence embeddings into images and leveraging pre-trained vision models for text classification, bridging vision and language domains.

Findings

01

Effective sentiment analysis achieved with vision models on transformed text data.

02

Transforming BERT embeddings into images enables cross-domain transfer learning.

03

Method requires no additional heavy compute resources.

Abstract

Knowledge is acquired by humans through experience, and no boundary is set between the kinds of knowledge or skill levels we can achieve on different tasks at the same time. When it comes to Neural Networks, that is not the case. The breakthroughs in the field are extremely task and domain-specific. Vision and language are dealt with in separate manners, using separate methods and different datasets. Current text classification methods, mostly rely on obtaining contextual embeddings for input text samples, then training a classifier on the embedded dataset. Transfer learning in Language-related tasks in general, is heavily used in obtaining the contextual text embeddings for the input samples. In this work, we propose to use the knowledge acquired by benchmark Vision Models which are trained on ImageNet to help a much smaller architecture learn to classify text. A data transformation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EddCBen/Classifying-Textual-Data-with-pretrained-Vision-Models-through-Transfer-Learning-and-Data-Transforms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Convolution · Batch Normalization · Average Pooling · Global Average Pooling · 1x1 Convolution · Bottleneck Residual Block