Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
Yang Shu, Zhangjie Cao, Jinghan Gao, Jianmin Wang, Philip S. Yu,, Mingsheng Long

TL;DR
Omni-Training introduces a unified framework combining pre-training and meta-training with a tri-flow architecture and Omni-Loss, significantly enhancing data efficiency and transferability in few-shot learning across diverse tasks and domains.
Contribution
The paper proposes a novel Omni-Training framework with a tri-flow Omni-Net architecture and Omni-Loss, effectively bridging pre-training and meta-training for improved few-shot learning.
Findings
Outperforms state-of-the-art methods in cross-task and cross-domain settings
Enhances transferability in classification, regression, and reinforcement learning
Demonstrates consistent improvements across multiple benchmarks
Abstract
Few-shot learning aims to fast adapt a deep model from a few examples. While pre-training and meta-training can create deep models powerful for few-shot generalization, we find that pre-training and meta-training focuses respectively on cross-domain transferability and cross-task transferability, which restricts their data efficiency in the entangled settings of domain shift and task shift. We thus propose the Omni-Training framework to seamlessly bridge pre-training and meta-training for data-efficient few-shot learning. Our first contribution is a tri-flow Omni-Net architecture. Besides the joint representation flow, Omni-Net introduces two parallel flows for pre-training and meta-training, responsible for improving domain transferability and task transferability respectively. Omni-Net further coordinates the parallel flows by routing their representations via the joint-flow, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
