STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction
Samah Fodeh, Ganesh Puthiaraju, Elyas Irankhah, Linhai Ma, Srivani Talakokkul, Afshan Khan, Sreeraj Ramachandran, Jordan Alpert, Sarah Schellhorn

TL;DR
This paper introduces a novel framework combining controllable inference and a robust optimization method, STaR-DRO, to improve structured prediction accuracy and reliability in heterogeneous group settings, especially in clinical text analysis.
Contribution
It proposes a new prompting strategy for structured generation and a stateful group-robust optimization method, STaR-DRO, for better handling group heterogeneity in structured prediction tasks.
Findings
Prompt engineering improves zero-shot F1 by +15.44 on EPPC Miner.
STaR-DRO increases Code F1 from 79.24 to 81.47 on Llama-3.3-70B-Instruct.
Reduces group-wise validation cross-entropy by up to 29.6% on difficult clinical categories.
Abstract
Structured prediction requires models to generate ontology-constrained labels, grounded evidence, and valid structure under ambiguity, label skew, and heterogeneous group difficulty. We present a two-part framework for controllable inference and robust fine-tuning. First, we introduce a task-agnostic prompting strategy that combines XML-based instruction structure, disambiguation rules, verification-style reasoning, schema constraints, and self-validation to address format drift, label ambiguity, evidence hallucination, and metadata-conditioned confusion in in-context structured generation. Second, we introduce STaR-DRO, a stateful robust optimization method for group heterogeneity. It combines Tsallis mirror descent with momentum-smoothed, centered group-loss signals and bounded excess-only multipliers so that only persistently hard groups above a neutral baseline are upweighted,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
