Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
Yu Fu, Jie He, Yifan Yang, Qun Liu, Deyi Xiong

TL;DR
Meta-RTL introduces a reinforcement learning framework that dynamically weights source tasks in meta-transfer learning, significantly improving low-resource commonsense reasoning performance using BERT and ALBERT.
Contribution
It proposes a novel reinforcement-based approach to estimate source task relevance, enhancing knowledge transfer in low-resource settings.
Findings
Meta-RTL outperforms existing methods on benchmark datasets.
The approach achieves larger gains in extremely low-resource scenarios.
Reinforcement-based task weighting improves transfer learning effectiveness.
Abstract
Meta learning has been widely used to exploit rich-resource source tasks to improve the performance of low-resource target tasks. Unfortunately, most existing meta learning approaches treat different source tasks equally, ignoring the relatedness of source tasks to the target task in knowledge transfer. To mitigate this issue, we propose a reinforcement-based multi-source meta-transfer learning framework (Meta-RTL) for low-resource commonsense reasoning. In this framework, we present a reinforcement-based approach to dynamically estimating source task weights that measure the contribution of the corresponding tasks to the target task in the meta-transfer learning. The differences between the general loss of the meta model and task-specific losses of source-specific temporal meta models on sampled target data are fed into the policy network of the reinforcement learning module as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Softmax · Attention Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Adam · WordPiece
