Scaling Laws for Discriminative Classification in Large Language Models

Dean Wyatte; Fatemeh Tahmasbi; Ming Li; Thomas Markovich

arXiv:2405.15765·cs.CL·May 27, 2024

Scaling Laws for Discriminative Classification in Large Language Models

Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

PDF

Open Access

TL;DR

This paper explores how scaling laws influence discriminative classification performance in large language models, demonstrating improved accuracy and efficiency for customer support tasks through model size adjustments.

Contribution

It introduces a novel approach to framing LLMs as discriminative classifiers for customer support, with empirical scaling curves and analysis of trade-offs.

Findings

01

Scaling curves for validation loss and top-K accuracy are established.

02

Offline and online experiments show significant performance improvements.

03

Trade-offs between model size, latency, and accuracy are discussed.

Abstract

Modern large language models (LLMs) represent a paradigm shift in what can plausibly be expected of machine learning models. The fact that LLMs can effectively generate sensible answers to a diverse range of queries suggests that they would be useful in customer support applications. While powerful, LLMs have been observed to be prone to hallucination which unfortunately makes their near term use in customer support applications challenging. To address this issue we present a system that allows us to use an LLM to augment our customer support advocates by re-framing the language modeling task as a discriminative classification task. In this framing, we seek to present the top-K best template responses for a customer support advocate to use when responding to a customer. We present the result of both offline and online experiments where we observed offline gains and statistically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsIs Venmo Customer Support Available 24/7? How to Reach a Real Person