Protoformer: Embedding Prototypes for Transformers

Ashkan Farhangi; Ning Sui; Nan Hua; Haiyan Bai; Arthur Huang; Zhishan; Guo

arXiv:2206.12710·cs.CL·June 28, 2022

Protoformer: Embedding Prototypes for Transformers

Ashkan Farhangi, Ning Sui, Nan Hua, Haiyan Bai, Arthur Huang, Zhishan, Guo

PDF

4 Repos

TL;DR

Protoformer is a self-learning framework that enhances transformer-based text classification by leveraging anomalies and difficult samples through prototype embedding, improving performance across diverse datasets.

Contribution

It introduces a novel prototype embedding mechanism for Transformers that utilizes problematic samples to boost classification accuracy.

Findings

01

Improves transformer performance on noisy and anomaly-rich datasets

02

Effectively leverages problematic samples for better classification

03

Demonstrates robustness across diverse textual datasets

Abstract

Transformers have been widely applied in text classification. Unfortunately, real-world data contain anomalies and noisy labels that cause challenges for state-of-art Transformers. This paper proposes Protoformer, a novel self-learning framework for Transformers that can leverage problematic samples for text classification. Protoformer features a selection mechanism for embedding samples that allows us to efficiently extract and utilize anomalies prototypes and difficult class prototypes. We demonstrated such capabilities on datasets with diverse textual structures (e.g., Twitter, IMDB, ArXiv). We also applied the framework to several models. The results indicate that Protoformer can improve current Transformers in various empirical settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSelf-Learning