LLMs for Test Input Generation for Semantic Caches

Zafaryab Rasool; Scott Barnett; David Willie; Stefanus Kurniawan,; Sherwin Balugo; Srikanth Thudumu; Mohamed Abdelrazek

arXiv:2401.08138·cs.SE·January 17, 2024·1 cites

LLMs for Test Input Generation for Semantic Caches

Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan,, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek

PDF

Open Access

TL;DR

This paper introduces VaryGen, a novel method leveraging LLMs to generate similar test queries from unstructured text, aiding the evaluation and calibration of semantic caches in large-scale semantic systems.

Contribution

VaryGen uses LLM reasoning to adapt, synthesize, and evaluate test queries, addressing the lack of labeled data for semantic cache testing.

Findings

01

Generated queries align with human expectations of similarity.

02

The approach reveals failure cases of semantic caches.

03

Evaluation on Qasper dataset demonstrates effectiveness.

Abstract

Large language models (LLMs) enable state-of-the-art semantic capabilities to be added to software systems such as semantic search of unstructured documents and text generation. However, these models are computationally expensive. At scale, the cost of serving thousands of users increases massively affecting also user experience. To address this problem, semantic caches are used to check for answers to similar queries (that may have been phrased differently) without hitting the LLM service. Due to the nature of these semantic cache techniques that rely on query embeddings, there is a high chance of errors impacting user confidence in the system. Adopting semantic cache techniques usually requires testing the effectiveness of a semantic cache (accurate cache hits and misses) which requires a labelled test set of similar queries and responses which is often unavailable. In this paper, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

Methodstravel james · Sparse Evolutionary Training