Investigating the Impact of Language-Adaptive Fine-Tuning on Sentiment Analysis in Hausa Language Using AfriBERTa
Sani Abdullahi Sani, Shamsuddeen Hassan Muhammad, Devon Jarvis

TL;DR
This study explores how Language-Adaptive Fine-Tuning improves sentiment analysis in Hausa by adapting AfriBERTa with a curated corpus, showing modest gains and emphasizing the importance of diverse data for low-resource languages.
Contribution
It demonstrates the effectiveness of LAFT on Hausa sentiment analysis and highlights the significance of pre-trained models and diverse datasets for low-resource NLP.
Findings
LAFT provides modest performance improvements.
Pre-trained AfriBERTa outperforms non-Hausa models.
Formal Hausa text limits the effectiveness of LAFT.
Abstract
Sentiment analysis (SA) plays a vital role in Natural Language Processing (NLP) by ~identifying sentiments expressed in text. Although significant advances have been made in SA for widely spoken languages, low-resource languages such as Hausa face unique challenges, primarily due to a lack of digital resources. This study investigates the effectiveness of Language-Adaptive Fine-Tuning (LAFT) to improve SA performance in Hausa. We first curate a diverse, unlabeled corpus to expand the model's linguistic capabilities, followed by applying LAFT to adapt AfriBERTa specifically to the nuances of the Hausa language. The adapted model is then fine-tuned on the labeled NaijaSenti sentiment dataset to evaluate its performance. Our findings demonstrate that LAFT gives modest improvements, which may be attributed to the use of formal Hausa text rather than informal social media data. Nevertheless,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Digital Communication and Language · Authorship Attribution and Profiling
