PASC: Pipeline-Aware Conformal Prediction with Joint Coverage Guarantees for Multi-Stage NLP and LLM Pipelines
Varun Kotte

TL;DR
PASC introduces a pipeline-aware conformal prediction method that guarantees joint coverage across multiple NLP and LLM pipeline stages, improving reliability and efficiency over existing methods.
Contribution
It reduces multi-stage joint coverage to a single conformal prediction problem, providing finite-sample guarantees and better scalability for complex NLP pipelines.
Findings
Achieves 96.4% end-to-end coverage on a three-stage pipeline.
Maintains coverage under distribution shift where independent methods fail.
Runs 1.7x faster than Bonferroni correction and scales to more stages.
Abstract
Modern NLP and LLM systems are pipelines: named entity recognition (NER) -> entity disambiguation (NED) -> entity typing, retrieval-augmented generation (retriever -> reader), and agentic chains of planner -> tool -> critic. Errors compound across stages, but existing uncertainty quantification methods either calibrate each stage independently (no joint coverage) or apply a Bonferroni union bound (joint coverage, but conservative). We present PASC (Pipeline-Aware Split Conformal), which reduces multi-stage joint coverage to a single scalar conformal prediction problem on the joint maximum nonconformity score. PASC provides a finite-sample distribution-free guarantee that all K stages are simultaneously covered with probability at least 1 - alpha, and is nearly tight up to a 1/(n+1) factor. On a three-stage NER -> NED -> entity-typing pipeline over CoNLL-2003, PASC achieves 96.4%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
