Learning Abstract Task Representations
Mikhail M. Meskhi, Adriano Rivolli, Rafael G. Mantovani, Ricardo, Vilalta

TL;DR
This paper introduces a deep learning approach to generate high-level abstract meta-features for tasks, improving model performance prediction and feature relevance over traditional meta-features.
Contribution
It proposes a novel method to learn abstract meta-features as latent variables within a neural network, addressing limitations of traditional meta-features.
Findings
Meta-models with induced meta-features outperform others by ~18%
Abstract meta-features achieve high feature-relevance scores
Method demonstrates improved generalization performance prediction
Abstract
A proper form of data characterization can guide the process of learning-algorithm selection and model-performance estimation. The field of meta-learning has provided a rich body of work describing effective forms of data characterization using different families of meta-features (statistical, model-based, information-theoretic, topological, etc.). In this paper, we start with the abundant set of existing meta-features and propose a method to induce new abstract meta-features as latent variables in a deep neural network. We discuss the pitfalls of using traditional meta-features directly and argue for the importance of learning high-level task properties. We demonstrate our methodology using a deep neural network as a feature extractor. We demonstrate that 1) induced meta-models mapping abstract meta-features to generalization performance outperform other methods by ~18% on average, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
