Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot   Learning

Yang Shu; Zhangjie Cao; Jinghan Gao; Jianmin Wang; Philip S. Yu,; Mingsheng Long

arXiv:2110.07510·cs.LG·December 20, 2022

Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning

Yang Shu, Zhangjie Cao, Jinghan Gao, Jianmin Wang, Philip S. Yu,, Mingsheng Long

PDF

Open Access

TL;DR

Omni-Training introduces a unified framework combining pre-training and meta-training with a tri-flow architecture and Omni-Loss, significantly enhancing data efficiency and transferability in few-shot learning across diverse tasks and domains.

Contribution

The paper proposes a novel Omni-Training framework with a tri-flow Omni-Net architecture and Omni-Loss, effectively bridging pre-training and meta-training for improved few-shot learning.

Findings

01

Outperforms state-of-the-art methods in cross-task and cross-domain settings

02

Enhances transferability in classification, regression, and reinforcement learning

03

Demonstrates consistent improvements across multiple benchmarks

Abstract

Few-shot learning aims to fast adapt a deep model from a few examples. While pre-training and meta-training can create deep models powerful for few-shot generalization, we find that pre-training and meta-training focuses respectively on cross-domain transferability and cross-task transferability, which restricts their data efficiency in the entangled settings of domain shift and task shift. We thus propose the Omni-Training framework to seamlessly bridge pre-training and meta-training for data-efficient few-shot learning. Our first contribution is a tri-flow Omni-Net architecture. Besides the joint representation flow, Omni-Net introduces two parallel flows for pre-training and meta-training, responsible for improving domain transferability and task transferability respectively. Omni-Net further coordinates the parallel flows by routing their representations via the joint-flow, enabling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI