Loading paper
STED and Consistency Scoring: A Framework for Evaluating LLM Structured Output Reliability | Tomesphere