An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale
Yong Liu, Pavel Dmitriev, Yifei Huang, Andrew Brooks, Li Dong

TL;DR
This study evaluates transfer learning methods for classifying sales engagement emails, highlighting the effectiveness of fine-tuned BERT with sufficient labeled data and comparing various embeddings and strategies.
Contribution
It introduces an evaluation framework for transfer learning in sales email classification, assessing embeddings, labeled data requirements, and implementation strategies.
Findings
Fine-tuned BERT outperforms other models with 300+ labeled samples.
Feature-based approaches perform better with fewer than 300 samples.
Scalability tested on GPU clusters with increasing data sizes.
Abstract
This paper conducts an empirical investigation to evaluate transfer learning for classifying sales engagement emails arising from digital sales engagement platforms. Given the complexity of content and context of sales engagement, lack of standardized large corpora and benchmarks, limited labeled examples and heterogenous context of intent, this real-world use case poses both a challenge and an opportunity for adopting a transfer learning approach. We propose an evaluation framework to assess a high performance transfer learning (HPTL) approach in three key areas in addition to commonly used accuracy metrics: 1) effective embeddings and pretrained language model usage, 2) minimum labeled samples requirement and 3) transfer learning implementation strategies. We use in-house sales engagement email samples as the experiment dataset, which includes over 3000 emails labeled as positive,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Personal Information Management and User Behavior · Mental Health via Writing
MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · ELMo · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay
