Structured Prediction Cascades
David Weiss, Benjamin Sapp, Ben Taskar

TL;DR
Structured prediction cascades are a sequence of models that efficiently filter output spaces, enabling faster inference and learning while maintaining high accuracy in complex structured prediction tasks.
Contribution
We introduce a novel cascade architecture for structured prediction that optimizes a convex loss to balance filtering efficiency and accuracy, extending to intractable models with theoretical guarantees.
Findings
Achieved state-of-the-art results in handwriting and human pose recognition.
Enabled significant speedups and use of complex features in structured prediction.
Provided theoretical bounds for accuracy and efficiency of the cascade approach.
Abstract
Structured prediction tasks pose a fundamental trade-off between the need for model complexity to increase predictive power and the limited computational resources for inference in the exponentially-sized output spaces such models require. We formulate and develop the Structured Prediction Cascade architecture: a sequence of increasingly complex models that progressively filter the space of possible outputs. The key principle of our approach is that each model in the cascade is optimized to accurately filter and refine the structured output state space of the next model, speeding up both learning and inference in the next layer of the cascade. We learn cascades by optimizing a novel convex loss function that controls the trade-off between the filtering efficiency and the accuracy of the cascade, and provide generalization bounds for both accuracy and efficiency. We also extend our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning and Algorithms
