Privileged Zero-Shot AutoML
Nikhil Singh, Brandon Kates, Jeff Mentch, Anant Kharkar, Madeleine, Udell, Iddo Drori

TL;DR
This paper introduces a zero-shot AutoML method that leverages textual descriptions of datasets and algorithms, significantly reducing computation time from minutes to milliseconds while maintaining high performance.
Contribution
It is the first to utilize privileged textual information with a Transformer model for AutoML, enabling rapid, zero-shot pipeline prediction across datasets.
Findings
Zero-shot AutoML achieves comparable performance to traditional methods.
Using textual descriptions improves classification accuracy.
Prediction times are reduced from minutes to milliseconds.
Abstract
This work improves the quality of automated machine learning (AutoML) systems by using dataset and function descriptions while significantly decreasing computation time from minutes to milliseconds by using a zero-shot approach. Given a new dataset and a well-defined machine learning task, humans begin by reading a description of the dataset and documentation for the algorithms to be used. This work is the first to use these textual descriptions, which we call privileged information, for AutoML. We use a pre-trained Transformer model to process the privileged text and demonstrate that using this information improves AutoML performance. Thus, our approach leverages the progress of unsupervised representation learning in natural language processing to provide a significant boost to AutoML. We demonstrate that using only textual descriptions of the data and functions achieves reasonable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Graph Neural Network · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Dropout · Layer Normalization · Adam
