BandiK: Efficient Multi-Task Decomposition Using a Multi-Bandit Framework
Andr\'as Millinghoffer (1, 2), Andr\'as Formanek (1, 3), Andr\'as Antos (1), P\'eter Antal (1, 2) ((1) Department of Artificial Intelligence, Systems Engineering, Faculty of Electrical Engineering, Informatics, Budapest University of Technology, Economics

TL;DR
BandiK introduces a multi-bandit framework for efficient multi-task auxiliary set selection, significantly reducing computational costs and improving transfer learning across multiple tasks.
Contribution
It presents a novel three-stage multi-bandit method that estimates task transferability, constructs candidate auxiliary sets efficiently, and employs a multi-bandit structure for optimal task transfer.
Findings
Reduces auxiliary task set evaluation complexity
Improves transfer learning efficiency across tasks
Demonstrates effectiveness on multi-task learning benchmarks
Abstract
The challenge of effectively transferring knowledge across multiple tasks is of critical importance and is also present in downstream tasks with foundation models. However, the nature of transfer, its transitive-intransitive nature, is still an open problem, and negative transfer remains a significant obstacle. Selection of beneficial auxiliary task sets in multi-task learning is frequently hindered by the high computational cost of their evaluation, the high number of plausible candidate auxiliary sets, and the varying complexity of selection across target tasks. To address these constraints, we introduce BandiK, a novel three-stage multi-task auxiliary task subset selection method using multi-bandits, where each arm pull evaluates candidate auxiliary sets by training and testing a multiple output neural network on a single random train-test dataset split. Firstly, BandiK estimates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics
