Device Tuning for Multi-Task Large Model

Penghao Jiang; Xuanchen Hou; Yinsi Zhou

arXiv:2302.10820·cs.CV·February 22, 2023

Device Tuning for Multi-Task Large Model

Penghao Jiang, Xuanchen Hou, Yinsi Zhou

PDF

Open Access

TL;DR

This paper introduces Device Tuning, a framework that enhances multi-task learning efficiency across cloud and device by reducing communication costs through representation compression, enabling better generalization across tasks.

Contribution

The paper proposes a novel Device Tuning architecture that improves multi-task learning efficiency and generalization by optimizing communication between cloud and device.

Findings

01

Effective reduction in communication overhead.

02

Improved multi-task learning performance.

03

Demonstrated generalization across various tasks.

Abstract

Unsupervised pre-training approaches have achieved great success in many fields such as Computer Vision (CV), Natural Language Processing (NLP) and so on. However, compared to typical deep learning models, pre-training or even fine-tuning the state-of-the-art self-attention models is extremely expensive, as they require much more computational and memory resources. It severely limits their applications and success in a variety of domains, especially for multi-task learning. To improve the efficiency, we propose Device Tuning for the efficient multi-task model, which is a massively multitask framework across the cloud and device and is designed to encourage learning of representations that generalize better to many different tasks. Specifically, we design Device Tuning architecture of a multi-task model that benefits both cloud modelling and device modelling, which reduces the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Machine Learning and ELM