MLink: Linking Black-Box Models from Multiple Domains for Collaborative   Inference

Mu Yuan; Lan Zhang; Zimu Zheng; Yi-Nan Zhang; Xiang-Yang Li

arXiv:2209.13883·cs.AI·June 8, 2023

MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference

Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li

PDF

Open Access 3 Repos

TL;DR

MLink introduces a novel approach to connect diverse black-box ML models through learned mappings, enabling cost-efficient multi-model inference with high accuracy in resource-constrained environments.

Contribution

The paper proposes a new model linking technique for heterogeneous black-box models and a scheduling algorithm, MLink, to improve inference efficiency under budget constraints.

Findings

01

MLink reduces inference computations by 66.7%

02

Achieves 94% inference accuracy under GPU memory constraints

03

Outperforms existing baselines in cost-efficiency and accuracy

Abstract

The cost efficiency of model inference is critical to real-world machine learning (ML) applications, especially for delay-sensitive tasks and resource-limited devices. A typical dilemma is: in order to provide complex intelligent services (e.g. smart city), we need inference results of multiple ML models, but the cost budget (e.g. GPU memory) is not enough to run all of them. In this work, we study underlying relationships among black-box ML models and propose a novel learning task: model linking, which aims to bridge the knowledge of different black-box models by learning mappings (dubbed model links) between their output spaces. We propose the design of model links which supports linking heterogeneous black-box ML models. Also, in order to address the distribution discrepancy challenge, we present adaptation and aggregation methods of model links. Based on our proposed model links, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Age of Information Optimization · Domain Adaptation and Few-Shot Learning