Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms
Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang, Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

TL;DR
This paper demonstrates that simple, parameter-free pooling models based on word embeddings can perform as well as or better than complex neural architectures across various NLP tasks, challenging assumptions about the necessity of sophisticated compositional functions.
Contribution
The paper provides a comprehensive comparison showing the effectiveness of simple pooling methods over complex models and introduces two novel pooling strategies for improved interpretability and spatial information retention.
Findings
SWEMs perform comparably or better than RNN/CNN models on most datasets.
Max-pooling enhances interpretability of word embeddings.
Hierarchical pooling preserves n-gram information within text sequences.
Abstract
Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWEMs), consisting of parameter-free pooling operations, relative to word-embedding-based RNN/CNN models. Surprisingly, SWEMs exhibit comparable or even superior performance in the majority of cases considered. Based upon this understanding, we propose two additional pooling strategies over learned word embeddings: (i) a max-pooling operation for improved interpretability; and (ii) a hierarchical pooling operation, which preserves spatial (n-gram) information within text sequences. We present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
