Efficiently Learning Branching Networks for Multitask Algorithmic Reasoning
Dongyue Li, Zhenshuo Zhang, Minxuan Duan, Edgar Dobriban, Hongyang R. Zhang

TL;DR
This paper introduces AutoBRANE, an efficient algorithm for automatically learning optimal branching structures in neural networks to improve multitask algorithmic reasoning across graph and text benchmarks.
Contribution
We propose AutoBRANE, a convex relaxation-based method that efficiently searches for optimal task partitions in branching neural networks for multitask reasoning.
Findings
AutoBRANE estimates task performance within 5% error across multiple models.
It outperforms existing baselines on the CLRS benchmark with a 3.7% accuracy increase.
AutoBRANE reduces runtime by 48% and memory by 26% on complex benchmarks.
Abstract
Algorithmic reasoning -- the ability to perform step-by-step logical inference -- has become a core benchmark for evaluating reasoning in graph neural networks (GNNs) and large language models (LLMs). Ideally, one would like to design a single model capable of performing well on multiple algorithmic reasoning tasks simultaneously. However, this is challenging when the execution steps of algorithms differ from one another, causing negative interference when they are trained together. We propose branching neural networks, a principled architecture for multitask algorithmic reasoning. Searching for the optimal -ary tree with layers over algorithmic tasks is combinatorial, requiring exploration of up to possible structures. We develop AutoBRANE, an efficient algorithm that reduces this search to time by solving a convex relaxation at each layer to approximate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Topic Modeling
