Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource   Commonsense Reasoning

Yu Fu; Jie He; Yifan Yang; Qun Liu; Deyi Xiong

arXiv:2409.19075·cs.CL·April 14, 2025

Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning

Yu Fu, Jie He, Yifan Yang, Qun Liu, Deyi Xiong

PDF

Open Access

TL;DR

Meta-RTL introduces a reinforcement learning framework that dynamically weights source tasks in meta-transfer learning, significantly improving low-resource commonsense reasoning performance using BERT and ALBERT.

Contribution

It proposes a novel reinforcement-based approach to estimate source task relevance, enhancing knowledge transfer in low-resource settings.

Findings

01

Meta-RTL outperforms existing methods on benchmark datasets.

02

The approach achieves larger gains in extremely low-resource scenarios.

03

Reinforcement-based task weighting improves transfer learning effectiveness.

Abstract

Meta learning has been widely used to exploit rich-resource source tasks to improve the performance of low-resource target tasks. Unfortunately, most existing meta learning approaches treat different source tasks equally, ignoring the relatedness of source tasks to the target task in knowledge transfer. To mitigate this issue, we propose a reinforcement-based multi-source meta-transfer learning framework (Meta-RTL) for low-resource commonsense reasoning. In this framework, we present a reinforcement-based approach to dynamically estimating source task weights that measure the contribution of the corresponding tasks to the target task in the meta-transfer learning. The differences between the general loss of the meta model and task-specific losses of source-specific temporal meta models on sampled target data are fed into the policy network of the reinforcement learning module as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Softmax · Attention Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Adam · WordPiece