SandboxAQ's submission to MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval
Isidora Chara Tourni, Sayontan Ghosh, Brenda Miao, Constantijn van der, Poel

TL;DR
This paper evaluates the performance of five large language models on multilingual question answering and named entity recognition, highlighting the variability across languages and tasks, and emphasizing the importance of task-specific approaches.
Contribution
It provides a comprehensive comparison of prompting techniques across multiple languages and tasks, revealing the nuanced effectiveness of models and prompting strategies in multilingual NLP.
Findings
Advanced prompting improves QA performance
Model effectiveness varies by language and task
Language difficulty impacts model performance
Abstract
This paper explores the problems of Question Answering (QA) and Named Entity Recognition (NER) in five diverse languages. We tested five Large Language Models with various prompting methods, including zero-shot, chain-of-thought reasoning, and translation techniques. Our results show that while some models consistently outperform others, their effectiveness varies significantly across tasks and languages. We saw that advanced prompting techniques generally improved QA performance but had mixed results for NER; and we observed that language difficulty patterns differed between tasks. Our findings highlight the need for task-specific approaches in multilingual NLP and suggest that current models may develop different linguistic competencies for different tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Absolute Position Encodings · Multi-Head Attention
