Guided Evolution with Binary Discriminators for ML Program Search
John D. Co-Reyes, Yingjie Miao, George Tucker, Aleksandra Faust,, Esteban Real

TL;DR
This paper introduces a guided evolution approach using a binary discriminator to efficiently search for better machine learning programs, significantly speeding up the process across various tasks.
Contribution
It presents a novel method that employs an online-trained discriminator to guide evolutionary search in AutoML, improving speed and effectiveness.
Findings
Achieved 3.7x speedup in symbolic optimizer search
Achieved 4x speedup in RL loss function optimization
Demonstrated effectiveness across diverse ML components
Abstract
How to automatically design better machine learning programs is an open problem within AutoML. While evolution has been a popular tool to search for better ML programs, using learning itself to guide the search has been less successful and less understood on harder problems but has the promise to dramatically increase the speed and final performance of the optimization process. We propose guiding evolution with a binary discriminator, trained online to distinguish which program is better given a pair of programs. The discriminator selects better programs without having to perform a costly evaluation and thus speed up the convergence of evolution. Our method can encode a wide variety of ML components including symbolic optimizers, neural architectures, RL loss functions, and symbolic regression equations with the same directed acyclic graph representation. By combining this…
Peer Reviews
Decision·Submitted to ICLR 2024
The idea of guided mutation is interesting and well-inspired. It is great to see that this method uses a general approach that work with many different AutoML tasks, where the genotypes can be represented by DAGs. This general approach tries to predict the performance of a solution without evaluation, and thus filter out bad solutions in mutation to hopefully speed up evolution. While all the components are from existing work, the method itself is novel given to its genrality and application on
The main issue of the paper is that the author claims to improve the efficiency of evolution by reducing the sample size needed for evaluation, but completely ignores the computation cost of the proposed method itself. While we do see that the method seems to slightly reduce the sample size, it is unclear if there is really a speed-up in evolution. There are many factors that will affect the actual time, such the frequency of online training and the size of GNN, the average inference speed of ea
Evolving ML program in a large space
As summarized by AC from NeurIPS, the technical contributions are limited. In addition, The paper does not compare with other state-of-the-art methods for evolutionary AutoML, such as RL or generative models.
- Draft is very well written. Language is clear at all times, the structure and sequence make complete sense, and figures and plots are in general easy to understand. - The proposed concept is not only very innovative, but also very *sane*: rather than using a predictor to rank a range of generated offspring, just better use it to decide which would be the best among two options. It makes sense because it sounds to be less error-prone. And as their ablation study clarifies, it ultimately works b
Having talking about the strengths of the paper, now I have to say it that I actually love it and hate it at the same time, or maybe I hate to love it. We must understand that these kind of approaches are obfuscating an already quite obscure area: deep learning. DNN are essentially black boxes; efforts to disentangle their inner workings haven't yielded very tangible fruits yet. AutoML and NAS add another layer of obscurity. Even Genetic Programming, when used for serious task and not just toy
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research · Algorithms and Data Compression
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
