Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation
Casimiro Pio Carrino, Carlos Escolano, Jos\'e A. R. Fonollosa

TL;DR
This paper introduces a novel self-knowledge distillation method with mAP@k coefficients to improve generalized cross-lingual question answering, especially in resource-limited and zero-shot scenarios, outperforming traditional fine-tuning methods.
Contribution
It presents a new self-distillation training strategy with dynamic knowledge regulation, enhancing cross-lingual QA transfer in low-resource settings.
Findings
Outperforms standard cross-entropy fine-tuning in cross-lingual QA tasks.
Achieves competitive results with machine-translated data in resource-constrained scenarios.
Provides comprehensive analysis and ablation studies validating the approach.
Abstract
Despite substantial progress in multilingual extractive Question Answering (QA), models with high and uniformly distributed performance across languages remain challenging, especially for languages with limited resources. We study cross-lingual transfer mainly focusing on the Generalized Cross-Lingual Transfer (G-XLT) task, where the question language differs from the context language - a challenge that has received limited attention thus far. Our approach seeks to enhance cross-lingual QA transfer using a high-performing multilingual model trained on a large-scale dataset, complemented by a few thousand aligned QA examples across languages. Our proposed strategy combines cross-lingual sampling and advanced self-distillation training in generations to tackle the previous challenge. Notably, we introduce the novel mAP@k coefficients to fine-tune self-knowledge distillation loss,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications
