Betty: An Automatic Differentiation Library for Multilevel Optimization
Sang Keun Choe, Willie Neiswanger, Pengtao Xie, Eric Xing

TL;DR
Betty is a new software library that simplifies and accelerates multilevel optimization tasks in machine learning, reducing computational costs and memory usage while supporting large-scale models.
Contribution
Introducing Betty, a novel automatic differentiation library for large-scale multilevel optimization that improves efficiency, scalability, and flexibility compared to existing methods.
Findings
Reduced computational complexity from O(d^3) to O(d^2)
Achieved up to 11% increase in test accuracy
Decreased GPU memory usage by 14% and training time by 20%
Abstract
Gradient-based multilevel optimization (MLO) has gained attention as a framework for studying numerous problems, ranging from hyperparameter optimization and meta-learning to neural architecture search and reinforcement learning. However, gradients in MLO, which are obtained by composing best-response Jacobians via the chain rule, are notoriously difficult to implement and memory/compute intensive. We take an initial step towards closing this gap by introducing Betty, a software library for large-scale MLO. At its core, we devise a novel dataflow graph for MLO, which allows us to (1) develop efficient automatic differentiation for MLO that reduces the computational complexity from O(d^3) to O(d^2), (2) incorporate systems support such as mixed-precision and data-parallel training for scalability, and (3) facilitate implementation of MLO programs of arbitrary complexity while allowing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Neural Networks and Reservoir Computing · Advanced Bandit Algorithms Research
MethodsLib · Test
