EARLIN: Early Out-of-Distribution Detection for Resource-efficient Collaborative Inference
Sumaiya Tabassum Nimi, Md Adnan Arefeen, Md Yusuf Sarwar Uddin,, Yugyung Lee

TL;DR
EARLIN is a lightweight, pretrained-model-based OOD detection method that improves resource efficiency in collaborative inference by identifying out-of-distribution inputs early, reducing unnecessary cloud communication.
Contribution
It introduces a novel OOD detection approach that operates on shallow CNN features without retraining or exposing OOD datasets, tailored for resource-constrained edge devices.
Findings
Outperforms existing OOD detection methods in accuracy
Reduces communication and computation costs in collaborative inference
Works effectively without retraining pretrained models
Abstract
Collaborative inference enables resource-constrained edge devices to make inferences by uploading inputs (e.g., images) to a server (i.e., cloud) where the heavy deep learning models run. While this setup works cost-effectively for successful inferences, it severely underperforms when the model faces input samples on which the model was not trained (known as Out-of-Distribution (OOD) samples). If the edge devices could, at least, detect that an input sample is an OOD, that could potentially save communication and computation resources by not uploading those inputs to the server for inference workload. In this paper, we propose a novel lightweight OOD detection approach that mines important features from the shallow layers of a pretrained CNN model and detects an input sample as ID (In-Distribution) or OOD based on a distance function defined on the reduced feature space. Our technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
