LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content
Mohamed Bayan Kmainasi, Ali Ezzat Shahroor, Maram Hasanain, Sahinur, Rahman Laskar, Naeemul Hassan, Firoj Alam

TL;DR
LlamaLens is a novel multilingual LLM specifically designed for analyzing news and social media content, outperforming existing models on multiple datasets across Arabic, English, and Hindi, addressing a significant gap in domain-specific multilingual NLP.
Contribution
This paper introduces LlamaLens, the first specialized multilingual LLM for news and social media analysis, focusing on domain-specificity and multilingual capabilities, with extensive evaluation across 18 tasks and 52 datasets.
Findings
LlamaLens outperforms SOTA on 23 datasets
Achieves comparable performance on 8 datasets
Models and resources are publicly available
Abstract
Large Language Models (LLMs) have demonstrated remarkable success as general-purpose task solvers across various fields. However, their capabilities remain limited when addressing domain-specific problems, particularly in downstream NLP tasks. Research has shown that models fine-tuned on instruction-based downstream NLP datasets outperform those that are not fine-tuned. While most efforts in this area have primarily focused on resource-rich languages like English and broad domains, little attention has been given to multilingual settings and specific domains. To address this gap, this study focuses on developing a specialized LLM, LlamaLens, for analyzing news and social media content in a multilingual context. To the best of our knowledge, this is the first attempt to tackle both domain specificity and multilinguality, with a particular focus on news and social media. Our experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsText and Document Classification Technologies · Web Data Mining and Analysis · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need · Focus
