PromptPort: A Reliability Layer for Cross-Model Structured Extraction

Varun Kotte

arXiv:2601.06151·cs.LG·January 13, 2026

PromptPort: A Reliability Layer for Cross-Model Structured Extraction

Varun Kotte

PDF

Open Access

TL;DR

PromptPort enhances the reliability of structured extraction from LLMs by addressing format collapse issues through canonicalization, verification, and abstention, significantly improving operational robustness without altering base models.

Contribution

The paper introduces PromptPort, a novel reliability layer that mitigates format collapse and cross-model variability in structured extraction tasks.

Findings

01

Severe format collapse observed across models (ROS 0.116 vs CSS 0.246).

02

PromptPort recovers 6-8 F1 points through canonicalization and verification.

03

Approaches oracle-level performance (0.890 vs 0.896 F1) without modifying base models.

Abstract

Structured extraction with LLMs fails in production not because models lack understanding, but because output formatting is unreliable across models and prompts. A prompt that returns clean JSON on GPT-4 may produce fenced, prose-wrapped, or malformed output on Llama, causing strict parsers to reject otherwise correct extractions. We formalize this as format collapse and introduce a dual-metric evaluation framework: ROS (strict parsing, measuring operational reliability) and CSS (post-canonicalization, measuring semantic capability). On a 37,346-example camera metadata benchmark across six model families, we find severe format collapse (for example, Gemma-2B: ROS 0.116 versus CSS 0.246) and large cross-model portability gaps (0.4 to 0.6 F1). We then present PromptPort, a reliability layer combining deterministic canonicalization with a lightweight verifier (DistilBERT) and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Digital Humanities and Scholarship · Scientific Computing and Data Management