LLMs for Test Input Generation for Semantic Caches
Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan,, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek

TL;DR
This paper introduces VaryGen, a novel method leveraging LLMs to generate similar test queries from unstructured text, aiding the evaluation and calibration of semantic caches in large-scale semantic systems.
Contribution
VaryGen uses LLM reasoning to adapt, synthesize, and evaluate test queries, addressing the lack of labeled data for semantic cache testing.
Findings
Generated queries align with human expectations of similarity.
The approach reveals failure cases of semantic caches.
Evaluation on Qasper dataset demonstrates effectiveness.
Abstract
Large language models (LLMs) enable state-of-the-art semantic capabilities to be added to software systems such as semantic search of unstructured documents and text generation. However, these models are computationally expensive. At scale, the cost of serving thousands of users increases massively affecting also user experience. To address this problem, semantic caches are used to check for answers to similar queries (that may have been phrased differently) without hitting the LLM service. Due to the nature of these semantic cache techniques that rely on query embeddings, there is a high chance of errors impacting user confidence in the system. Adopting semantic cache techniques usually requires testing the effectiveness of a semantic cache (accurate cache hits and misses) which requires a labelled test set of similar queries and responses which is often unavailable. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
Methodstravel james · Sparse Evolutionary Training
