TopicBERT for Energy Efficient Document Classification

Yatin Chaudhary; Pankaj Gupta; Khushbu Saxena; Vivek Kulkarni; Thomas; Runkler; Hinrich Sch\"utze

arXiv:2010.16407·cs.CL·November 2, 2020

TopicBERT for Energy Efficient Document Classification

Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas, Runkler, Hinrich Sch\"utze

PDF

1 Repo

TL;DR

TopicBERT is a unified model that reduces computational costs and carbon emissions in document classification by jointly learning topic and language models, achieving significant speedups with minimal performance loss.

Contribution

It introduces a novel framework that combines topic and language modeling to optimize fine-tuning efficiency for long document classification tasks.

Findings

01

1. Achieves 1.4x speedup in fine-tuning.

02

2. Reduces CO2 emissions by approximately 40%.

03

3. Maintains 99.9% performance across multiple datasets.

Abstract

Prior research notes that BERT's computational cost grows quadratically with sequence length thus leading to longer training times, higher GPU memory constraints and carbon emissions. While recent work seeks to address these scalability issues at pre-training, these issues are also prominent in fine-tuning especially for long sequence tasks like document classification. Our work thus focuses on optimizing the computational cost of fine-tuning for document classification. We achieve this by complementary learning of both topic and language models in a unified framework, named TopicBERT. This significantly reduces the number of self-attention operations - a main performance bottleneck. Consequently, our model achieves a 1.4x ( $\sim 40%$ ) speedup with $\sim 40%$ reduction in $C O_{2}$ emission while retaining $99.9%$ performance over 5 datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YatinChaudhary/TopicBERT
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.