STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction

Samah Fodeh; Ganesh Puthiaraju; Elyas Irankhah; Linhai Ma; Srivani Talakokkul; Afshan Khan; Sreeraj Ramachandran; Jordan Alpert; Sarah Schellhorn

arXiv:2604.09737·cs.LG·April 14, 2026

STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction

Samah Fodeh, Ganesh Puthiaraju, Elyas Irankhah, Linhai Ma, Srivani Talakokkul, Afshan Khan, Sreeraj Ramachandran, Jordan Alpert, Sarah Schellhorn

PDF

TL;DR

This paper introduces a novel framework combining controllable inference and a robust optimization method, STaR-DRO, to improve structured prediction accuracy and reliability in heterogeneous group settings, especially in clinical text analysis.

Contribution

It proposes a new prompting strategy for structured generation and a stateful group-robust optimization method, STaR-DRO, for better handling group heterogeneity in structured prediction tasks.

Findings

01

Prompt engineering improves zero-shot F1 by +15.44 on EPPC Miner.

02

STaR-DRO increases Code F1 from 79.24 to 81.47 on Llama-3.3-70B-Instruct.

03

Reduces group-wise validation cross-entropy by up to 29.6% on difficult clinical categories.

Abstract

Structured prediction requires models to generate ontology-constrained labels, grounded evidence, and valid structure under ambiguity, label skew, and heterogeneous group difficulty. We present a two-part framework for controllable inference and robust fine-tuning. First, we introduce a task-agnostic prompting strategy that combines XML-based instruction structure, disambiguation rules, verification-style reasoning, schema constraints, and self-validation to address format drift, label ambiguity, evidence hallucination, and metadata-conditioned confusion in in-context structured generation. Second, we introduce STaR-DRO, a stateful robust optimization method for group heterogeneity. It combines Tsallis mirror descent with momentum-smoothed, centered group-loss signals and bounded excess-only multipliers so that only persistently hard groups above a neutral baseline are upweighted,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.