Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary   Differential Equations

Yi Zhang; Chun-Wun Cheng; Junyi He; Zhihai He; Carola-Bibiane; Sch\"onlieb; Yuyan Chen; Angelica I Aviles-Rivero

arXiv:2412.15813·cs.CV·December 23, 2024

Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations

Yi Zhang, Chun-Wun Cheng, Junyi He, Zhihai He, Carola-Bibiane, Sch\"onlieb, Yuyan Chen, Angelica I Aviles-Rivero

PDF

Open Access 1 Video

TL;DR

SONO introduces a second-order neural ODE approach for cross-modal few-shot learning, improving generalization and reducing overfitting by leveraging text-based augmentation and efficient initialization, outperforming existing methods.

Contribution

The paper proposes SONO, a novel second-order neural ODE framework that enhances cross-modal few-shot learning with improved expressiveness and data augmentation techniques.

Findings

01

Outperforms state-of-the-art in few-shot learning tasks

02

Utilizes text-based augmentation with CLIP for data enrichment

03

Reduces overfitting through second-order neural ODEs

Abstract

We introduce SONO, a novel method leveraging Second-Order Neural Ordinary Differential Equations (Second-Order NODEs) to enhance cross-modal few-shot learning. By employing a simple yet effective architecture consisting of a Second-Order NODEs model paired with a cross-modal classifier, SONO addresses the significant challenge of overfitting, which is common in few-shot scenarios due to limited training examples. Our second-order approach can approximate a broader class of functions, enhancing the model's expressive power and feature generalization capabilities. We initialize our cross-modal classifier with text embeddings derived from class-relevant prompts, streamlining training efficiency by avoiding the need for frequent text encoder processing. Additionally, we utilize text-based image augmentation, exploiting CLIP's robust image-text correlation to enrich training data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations· underline

Taxonomy

TopicsSeismic Imaging and Inversion Techniques