RvLLM: LLM Runtime Verification with Domain Knowledge
Yedi Zhang, Sun Yi Emma, Annabelle Lee Jia En, Jin Song Dong

TL;DR
This paper introduces RvLLM, a runtime verification framework that uses a new specification language, ESL, to incorporate domain knowledge for detecting errors in LLM outputs across various tasks.
Contribution
The work presents a novel ESL specification language and a runtime verification framework, RvLLM, enabling domain experts to customize constraints for verifying LLM outputs in real-time.
Findings
RvLLM effectively detects erroneous outputs in multiple LLMs.
The framework is lightweight and flexible for various domain-specific tasks.
Experimental results show improved reliability of LLM outputs with RvLLM.
Abstract
Large language models (LLMs) have emerged as a dominant AI paradigm due to their exceptional text understanding and generation capabilities. However, their tendency to generate inconsistent or erroneous outputs challenges their reliability, especially in high-stakes domains requiring accuracy and trustworthiness. Existing research primarily focuses on detecting and mitigating model misbehavior in general-purpose scenarios, often overlooking the potential of integrating domain-specific knowledge. In this work, we advance misbehavior detection by incorporating domain knowledge. The core idea is to design a general specification language that enables domain experts to customize domain-specific predicates in a lightweight and intuitive manner, supporting later runtime verification of LLM outputs. To achieve this, we design a novel specification language, ESL, and introduce a runtime…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Mathematics, Computing, and Information Processing
