Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning

Yang Shu; Zhangjie Cao; Jinghan Gao; Jianmin Wang; Philip S. Yu; Mingsheng Long

doi:10.1109/tpami.2023.3319517

ScienceGate Book Chapters

JOURNAL ARTICLE

Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning

Yang Shu Zhangjie Cao Jinghan Gao Jianmin Wang Philip S. Yu Mingsheng Long

Year: 2023 Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 45 (12)Pages: 15275-15291 Publisher: IEEE Computer Society

DOI: 10.1109/tpami.2023.3319517

Get Full-Text PDF Get Analytical Report

Abstract

Few-shot learning aims to fast adapt a deep model from a few examples. While pre-training and meta-training can create deep models powerful for few-shot generalization, we find that pre-training and meta-training focus respectively on cross-domain transferability and cross-task transferability, which restricts their data efficiency in the entangled settings of domain shift and task shift. We thus propose the Omni-Training framework to seamlessly bridge pre-training and meta-training for data-efficient few-shot learning. Our first contribution is a tri-flow Omni-Net architecture. Besides the joint representation flow, Omni-Net introduces two parallel flows for pre-training and meta-training, responsible for improving domain transferability and task transferability respectively. Omni-Net further coordinates the parallel flows by routing their representations via the joint-flow, enabling knowledge transfer across flows. Our second contribution is the Omni-Loss, which introduces a self-distillation strategy separately on the pre-training and meta-training objectives for boosting knowledge transfer throughout different training stages. Omni-Training is a general framework to accommodate many existing algorithms. Evaluations justify that our single framework consistently and clearly outperforms the individual state-of-the-art methods on both cross-task and cross-domain settings in a variety of classification, regression and reinforcement learning problems.

Keywords:

Computer science Artificial intelligence Machine learning Transfer of learning Training (meteorology) Bridging (networking) Task analysis Deep learning Task (project management)

Metrics

Cited By

2.04

FWCI (Field Weighted Citation Impact)

152

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Meta-training with Demonstration Retrieval for Efficient Few-shot Learning

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

Few-Shot Learning on Graphs: From Meta-Learning to LLM-empowered Pre-Training and Beyond

Regularized Meta-Training with Embedding Mixup for Improved Few-Shot Learning

StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning