MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference
Mu Yuan, Lan Zhang, Zimu Zheng, Yi-Nan Zhang, Xiang-Yang Li

TL;DR
MLink introduces a novel approach to connect diverse black-box ML models through learned mappings, enabling cost-efficient multi-model inference with high accuracy in resource-constrained environments.
Contribution
The paper proposes a new model linking technique for heterogeneous black-box models and a scheduling algorithm, MLink, to improve inference efficiency under budget constraints.
Findings
MLink reduces inference computations by 66.7%
Achieves 94% inference accuracy under GPU memory constraints
Outperforms existing baselines in cost-efficiency and accuracy
Abstract
The cost efficiency of model inference is critical to real-world machine learning (ML) applications, especially for delay-sensitive tasks and resource-limited devices. A typical dilemma is: in order to provide complex intelligent services (e.g. smart city), we need inference results of multiple ML models, but the cost budget (e.g. GPU memory) is not enough to run all of them. In this work, we study underlying relationships among black-box ML models and propose a novel learning task: model linking, which aims to bridge the knowledge of different black-box models by learning mappings (dubbed model links) between their output spaces. We propose the design of model links which supports linking heterogeneous black-box ML models. Also, in order to address the distribution discrepancy challenge, we present adaptation and aggregation methods of model links. Based on our proposed model links, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Age of Information Optimization · Domain Adaptation and Few-Shot Learning
