Improving LLM Reliability through Hybrid Abstention and Adaptive Detection
Ankit Sharma, Nachiket Tapas, Jyotiprakash Patra

TL;DR
This paper presents a dynamic, context-aware abstention system for LLMs that improves safety, reduces latency, and maintains high utility by integrating multiple detectors in a hierarchical cascade.
Contribution
It introduces an adaptive abstention framework with a multi-detector cascade that dynamically balances safety and utility in LLM deployment.
Findings
Reduces false positives in sensitive domains
Achieves lower latency compared to static guardrails
Maintains high safety recall and precision
Abstract
Large Language Models (LLMs) deployed in production environments face a fundamental safety-utility trade-off either a strict filtering mechanisms prevent harmful outputs but often block benign queries or a relaxed controls risk unsafe content generation. Conventional guardrails based on static rules or fixed confidence thresholds are typically context-insensitive and computationally expensive, resulting in high latency and degraded user experience. To address these limitations, we introduce an adaptive abstention system that dynamically adjusts safety thresholds based on real-time contextual signals such as domain and user history. The proposed framework integrates a multi-dimensional detection architecture composed of five parallel detectors, combined through a hierarchical cascade mechanism to optimize both speed and precision. The cascade design reduces unnecessary computation by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education
