Distribution-Based Invariant Deep Networks for Learning Meta-Features

Gwendoline De Bie; Herilalaina Rakotoarison; Gabriel Peyr\'e,; Mich\`ele Sebag

arXiv:2006.13708·stat.ML·October 20, 2020

Distribution-Based Invariant Deep Networks for Learning Meta-Features

Gwendoline De Bie, Herilalaina Rakotoarison, Gabriel Peyr\'e,, Mich\`ele Sebag

PDF

Open Access

TL;DR

This paper introduces Dida, a neural architecture that achieves permutation invariance over features for learning dataset meta-features, demonstrating superior performance on dataset-level tasks compared to existing methods.

Contribution

The paper extends distribution-based neural networks to be invariant under feature permutation and empirically validates its effectiveness on dataset characterization tasks.

Findings

01

Dida outperforms state-of-the-art methods on dataset matching tasks.

02

Dida effectively characterizes datasets for hyper-parameter performance prediction.

03

The architecture is robust to Lipschitz-bounded transformations.

Abstract

Recent advances in deep learning from probability distributions successfully achieve classification or regression from distribution samples, thus invariant under permutation of the samples. The first contribution of the paper is to extend these neural architectures to achieve invariance under permutation of the features, too. The proposed architecture, called Dida, inherits the NN properties of universal approximation, and its robustness w.r.t. Lipschitz-bounded transformations of the input distribution is established. The second contribution is to empirically and comparatively demonstrate the merits of the approach on two tasks defined at the dataset level. On both tasks, Dida learns meta-features supporting the characterization of a (labelled) dataset. The first task consists of predicting whether two dataset patches are extracted from the same initial dataset. The second task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference

Methodsk-Nearest Neighbors · Logistic Regression · Support Vector Machine