Adversarial Structured Prediction for Multivariate Measures

Hong Wang; Ashkan Rezaei; Brian D. Ziebart

arXiv:1712.07374·stat.ML·December 22, 2017

Adversarial Structured Prediction for Multivariate Measures

Hong Wang, Ashkan Rezaei, Brian D. Ziebart

PDF

Open Access

TL;DR

This paper introduces an adversarial training framework for structured prediction that directly optimizes multivariate performance measures like F-score and AER, addressing the limitations of surrogate loss methods.

Contribution

It proposes a novel adversarial approach that approximates training data while directly optimizing the true multivariate evaluation metrics for structured prediction tasks.

Findings

01

Effective for word alignment with AER

02

Improves named entity recognition performance

03

Addresses mismatch issues in surrogate loss methods

Abstract

Many predicted structured objects (e.g., sequences, matchings, trees) are evaluated using the F-score, alignment error rate (AER), or other multivariate performance measures. Since inductively optimizing these measures using training data is typically computationally difficult, empirical risk minimization of surrogate losses is employed, using, e.g., the hinge loss for (structured) support vector machines. These approximations often introduce a mismatch between the learner's objective and the desired application performance, leading to inconsistency. We take a different approach: adversarially approximate training data while optimizing the exact F-score or AER. Structured predictions under this formulation result from solving zero-sum games between a predictor seeking the best performance and an adversary seeking the worst while required to (approximately) match certain structured…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning