Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective
Jinming Xing, Dongwen Luo, Chang Xue, Ruilin Xing

TL;DR
This paper compares different pooling mechanisms in Transformer-based LLMs like BERT and GPT, analyzing their impact on sentiment analysis performance and highlighting the importance of choosing appropriate pooling strategies for specific tasks.
Contribution
It provides a comprehensive comparison of pooling methods in LLMs for sentiment analysis, revealing their strengths and weaknesses across different architectures.
Findings
Max pooling performs best for sentiment classification in BERT.
Weighted sum pooling offers flexibility but is less consistent.
Pooling choice significantly affects model performance depending on task requirements.
Abstract
Large Language Models (LLMs) have revolutionized natural language processing (NLP) by delivering state-of-the-art performance across a variety of tasks. Among these, Transformer-based models like BERT and GPT rely on pooling layers to aggregate token-level embeddings into sentence-level representations. Common pooling mechanisms such as Mean, Max, and Weighted Sum play a pivotal role in this aggregation process. Despite their widespread use, the comparative performance of these strategies on different LLM architectures remains underexplored. To address this gap, this paper investigates the effects of these pooling mechanisms on two prominent LLM families -- BERT and GPT, in the context of sentence-level sentiment analysis. Comprehensive experiments reveal that each pooling mechanism exhibits unique strengths and weaknesses depending on the task's specific requirements. Our findings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinTech, Crowdfunding, Digital Finance · Collaboration in agile enterprises · ERP Systems Implementation and Impact
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Discriminative Fine-Tuning · Cosine Annealing · Linear Layer · Byte Pair Encoding · Adam · Residual Connection · Weight Decay · Softmax
