EARLIN: Early Out-of-Distribution Detection for Resource-efficient   Collaborative Inference

Sumaiya Tabassum Nimi; Md Adnan Arefeen; Md Yusuf Sarwar Uddin,; Yugyung Lee

arXiv:2106.13842·cs.CV·June 30, 2021

EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference

Sumaiya Tabassum Nimi, Md Adnan Arefeen, Md Yusuf Sarwar Uddin,, Yugyung Lee

PDF

TL;DR

EARLIN is a lightweight, pretrained-model-based OOD detection method that improves resource efficiency in collaborative inference by identifying out-of-distribution inputs early, reducing unnecessary cloud communication.

Contribution

It introduces a novel OOD detection approach that operates on shallow CNN features without retraining or exposing OOD datasets, tailored for resource-constrained edge devices.

Findings

01

Outperforms existing OOD detection methods in accuracy

02

Reduces communication and computation costs in collaborative inference

03

Works effectively without retraining pretrained models

Abstract

Collaborative inference enables resource-constrained edge devices to make inferences by uploading inputs (e.g., images) to a server (i.e., cloud) where the heavy deep learning models run. While this setup works cost-effectively for successful inferences, it severely underperforms when the model faces input samples on which the model was not trained (known as Out-of-Distribution (OOD) samples). If the edge devices could, at least, detect that an input sample is an OOD, that could potentially save communication and computation resources by not uploading those inputs to the server for inference workload. In this paper, we propose a novel lightweight OOD detection approach that mines important features from the shallow layers of a pretrained CNN model and detects an input sample as ID (In-Distribution) or OOD based on a distance function defined on the reduced feature space. Our technique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.