Small Language Models for Application Interactions: A Case Study

Beibin Li; Yi Zhang; S\'ebastien Bubeck; Jeevan Pathuri; Ishai Menache

arXiv:2405.20347·cs.CL·June 3, 2024·2 cites

Small Language Models for Application Interactions: A Case Study

Beibin Li, Yi Zhang, S\'ebastien Bubeck, Jeevan Pathuri, Ishai Menache

PDF

Open Access

TL;DR

This paper demonstrates that small language models can effectively facilitate application interactions, outperforming larger models in accuracy and speed, especially when fine-tuned on limited data, with insights into system design considerations.

Contribution

It provides a case study showing small language models' effectiveness in real-world application interactions and discusses design considerations for such systems.

Findings

01

Small models outperform larger ones in accuracy and speed.

02

Fine-tuning on small datasets is effective.

03

Insights into system design for SLM-based applications.

Abstract

We study the efficacy of Small Language Models (SLMs) in facilitating application usage through natural language interactions. Our focus here is on a particular internal application used in Microsoft for cloud supply chain fulfilment. Our experiments show that small models can outperform much larger ones in terms of both accuracy and running time, even when fine-tuned on small datasets. Alongside these results, we also highlight SLM-based system design considerations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Business Process Modeling and Analysis · Speech and dialogue systems

MethodsFocus