Comparing BERT against traditional machine learning text classification
Santiago Gonz\'alez-Carvajal, Eduardo C. Garrido-Merch\'an

TL;DR
This paper empirically compares BERT with traditional machine learning methods like TF-IDF and classical NLP approaches, demonstrating BERT's superior performance and robustness across various NLP classification tasks.
Contribution
It provides empirical evidence supporting BERT as a default choice for NLP classification tasks over traditional methods.
Findings
BERT outperforms traditional TF-IDF-based classifiers.
BERT's performance is consistent across different languages.
Empirical support for adopting BERT as a standard NLP tool.
Abstract
The BERT model has arisen as a popular state-of-the-art machine learning model in the recent years that is able to cope with multiple NLP tasks such as supervised text classification without human supervision. Its flexibility to cope with any type of corpus delivering great results has make this approach very popular not only in academia but also in the industry. Although, there are lots of different approaches that have been used throughout the years with success. In this work, we first present BERT and include a little review on classical NLP approaches. Then, we empirically test with a suite of experiments dealing different scenarios the behaviour of BERT against the traditional TF-IDF vocabulary fed to machine learning algorithms. Our purpose of this work is to add empirical evidence to support or refuse the use of BERT as a default on NLP tasks. Experiments show the superiority of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsLinear Layer · Weight Decay · Softmax · Adam · Multi-Head Attention · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Linear Warmup With Linear Decay · Dense Connections
