BandiK: Efficient Multi-Task Decomposition Using a Multi-Bandit Framework

Andr\'as Millinghoffer (1; 2); Andr\'as Formanek (1; 3); Andr\'as Antos (1); P\'eter Antal (1; 2) ((1) Department of Artificial Intelligence; Systems Engineering; Faculty of Electrical Engineering; Informatics; Budapest University of Technology; Economics; (2) E-Group ICT Software Zrt.; Budapest; Hungary; (3) Department of Electrical Engineering (ESAT); STADIUS Center for Dynamical Systems; Signal Processing; Data Analytics; KU Leuven)

arXiv:2512.24708·cs.LG·January 1, 2026

BandiK: Efficient Multi-Task Decomposition Using a Multi-Bandit Framework

Andr\'as Millinghoffer (1, 2), Andr\'as Formanek (1, 3), Andr\'as Antos (1), P\'eter Antal (1, 2) ((1) Department of Artificial Intelligence, Systems Engineering, Faculty of Electrical Engineering, Informatics, Budapest University of Technology, Economics

PDF

Open Access

TL;DR

BandiK introduces a multi-bandit framework for efficient multi-task auxiliary set selection, significantly reducing computational costs and improving transfer learning across multiple tasks.

Contribution

It presents a novel three-stage multi-bandit method that estimates task transferability, constructs candidate auxiliary sets efficiently, and employs a multi-bandit structure for optimal task transfer.

Findings

01

Reduces auxiliary task set evaluation complexity

02

Improves transfer learning efficiency across tasks

03

Demonstrates effectiveness on multi-task learning benchmarks

Abstract

The challenge of effectively transferring knowledge across multiple tasks is of critical importance and is also present in downstream tasks with foundation models. However, the nature of transfer, its transitive-intransitive nature, is still an open problem, and negative transfer remains a significant obstacle. Selection of beneficial auxiliary task sets in multi-task learning is frequently hindered by the high computational cost of their evaluation, the high number of plausible candidate auxiliary sets, and the varying complexity of selection across target tasks. To address these constraints, we introduce BandiK, a novel three-stage multi-task auxiliary task subset selection method using multi-bandits, where each arm pull evaluates candidate auxiliary sets by training and testing a multiple output neural network on a single random train-test dataset split. Firstly, BandiK estimates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics