Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits
Xiangheng He, Junjie Chen, Bj\"orn W. Schuller

TL;DR
This paper introduces a task selection method for multi-task dialogue act classification using non-stationary multi-armed bandits, improving performance by dynamically identifying useful tasks during training.
Contribution
It proposes a novel task assignment strategy based on non-stationary multi-armed bandits with discounted Thompson Sampling, tailored for multi-modal multi-task dialogue classification.
Findings
Significantly improves UAR and F1 scores over baselines.
Effectively identifies and avoids harmful or useless tasks during training.
Provides higher stability and performance on imbalanced datasets.
Abstract
Multi-task learning (MTL) aims to improve the performance of a primary task by jointly learning with related auxiliary tasks. Traditional MTL methods select tasks randomly during training. However, both previous studies and our results suggest that such a random selection of tasks may not be helpful, and can even be harmful to performance. Therefore, new strategies for task selection and assignment in MTL need to be explored. This paper studies the multi-modal, multi-task dialogue act classification task, and proposes a method for selecting and assigning tasks based on non-stationary multi-armed bandits (MAB) with discounted Thompson Sampling (TS) using Gaussian priors. Our experimental results show that in different training stages, different tasks have different utility. Our proposed method can effectively identify the task utility, actively avoid useless or harmful tasks, and realise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Advanced Bandit Algorithms Research · Context-Aware Activity Recognition Systems
