Approximation Algorithms for Cascading Prediction Models

Matthew Streeter

arXiv:1802.07697·cs.LG·February 22, 2018

Approximation Algorithms for Cascading Prediction Models

Matthew Streeter

PDF

Open Access

TL;DR

This paper introduces an approximation algorithm that creates cascaded prediction models from pre-trained models, significantly reducing computational cost and memory I/O while maintaining accuracy, demonstrated on ImageNet classification.

Contribution

It presents a novel approximation algorithm for constructing cost-efficient cascaded models from existing pre-trained models, optimizing for accuracy and computational efficiency.

Findings

01

Up to 2x reduction in floating point multiplications.

02

Up to 6x reduction in average-case memory I/O.

03

Cascades adapt input resolution and confidence thresholds based on image difficulty.

Abstract

We present an approximation algorithm that takes a pool of pre-trained models as input and produces from it a cascaded model with similar accuracy but lower average-case cost. Applied to state-of-the-art ImageNet classification models, this yields up to a 2x reduction in floating point multiplications, and up to a 6x reduction in average-case memory I/O. The auto-generated cascades exhibit intuitive properties, such as using lower-resolution input for easier images and requiring higher prediction confidence when using a computationally cheaper model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms