Investigating Cross-Domain Behaviors of BERT in Review Understanding

Albert Lu; Meng Jiang

arXiv:2306.15123·cs.CL·June 29, 2023

Investigating Cross-Domain Behaviors of BERT in Review Understanding

Albert Lu, Meng Jiang

PDF

Open Access

TL;DR

This paper empirically studies how BERT models behave across different review domains, revealing that multi-domain fine-tuned models offer a good balance of performance and resource efficiency in review understanding tasks.

Contribution

It provides the first empirical analysis of cross-domain BERT behaviors in product review classification, comparing single-domain and multi-domain fine-tuning effects.

Findings

01

Multi-domain models outperform single-domain models on multi-domain data.

02

Single-domain models perform slightly better on their specific domain.

03

Multi-domain models reduce computational costs while maintaining competitive accuracy.

Abstract

Review score prediction requires review text understanding, a critical real-world application of natural language processing. Due to dissimilar text domains in product reviews, a common practice is fine-tuning BERT models upon reviews of differing domains. However, there has not yet been an empirical study of cross-domain behaviors of BERT models in the various tasks of product review understanding. In this project, we investigate text classification BERT models fine-tuned on single-domain and multi-domain Amazon review data. In our findings, though single-domain models achieved marginally improved performance on their corresponding domain compared to multi-domain models, multi-domain models outperformed single-domain models when evaluated on multi-domain data, single-domain data the single-domain model was not fine-tuned on, and on average when considering all tests. Though slight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Sentiment Analysis and Opinion Mining

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Attention Dropout · WordPiece · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Residual Connection