Designing and Contextualising Probes for African Languages
Wisdom Aduah, Francois Meyer

TL;DR
This study systematically investigates how pretrained language models encode linguistic features of African languages, revealing that language-specific models contain more relevant information than multilingual models and analyzing the distribution of linguistic knowledge across layers.
Contribution
First systematic probing of African language PLMs, analyzing linguistic feature distribution and interpreting internal mechanisms with control tasks and baselines.
Findings
African language PLMs encode more linguistic info than multilingual models.
Token-level syntax is in middle-to-last layers; semantics are distributed.
Probing performance reflects true internal knowledge, not memorization.
Abstract
Pretrained language models (PLMs) for African languages are continually improving, but the reasons behind these advances remain unclear. This paper presents the first systematic investigation into probing PLMs for linguistic knowledge about African languages. We train layer-wise probes for six typologically diverse African languages to analyse how linguistic features are distributed. We also design control tasks, a way to interpret probe performance, for the MasakhaPOS dataset. We find PLMs adapted for African languages to encode more linguistic information about target languages than massively multilingual PLMs. Our results reaffirm previous findings that token-level syntactic information concentrates in middle-to-last layers, while sentence-level semantic information is distributed across all layers. Through control tasks and probing baselines, we confirm that performance reflects the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultilingual Education and Policy · EFL/ESL Teaching and Learning · ICT in Developing Communities
