FewTopNER: Integrating Few-Shot Learning with Topic Modeling and Named Entity Recognition in a Multilingual Framework
Ibrahim Bouabdallaoui, Fatima Guerouate, Samya Bouhaddour, Chaimae, Saadi, Mohammed Sbihi

TL;DR
FewTopNER presents a novel multilingual framework that combines few-shot named entity recognition with topic modeling, significantly improving performance in low-resource and cross-lingual scenarios by integrating semantic context.
Contribution
It introduces a unified architecture that fuses topic modeling with few-shot NER using a shared multilingual encoder and cross-task attention, advancing low-resource multilingual NER.
Findings
Achieves 2.5-4.0% higher F1 scores on multilingual benchmarks.
Enhances topic coherence measured by normalized pointwise mutual information.
Ablation confirms the importance of cross-task integration for performance.
Abstract
We introduce FewTopNER, a novel framework that integrates few-shot named entity recognition (NER) with topic-aware contextual modeling to address the challenges of cross-lingual and low-resource scenarios. FewTopNER leverages a shared multilingual encoder based on XLM-RoBERTa, augmented with language-specific calibration mechanisms, to generate robust contextual embeddings. The architecture comprises a prototype-based entity recognition branch, employing BiLSTM and Conditional Random Fields for sequence labeling, and a topic modeling branch that extracts document-level semantic features through hybrid probabilistic and neural methods. A cross-task bridge facilitates dynamic bidirectional attention and feature fusion between entity and topic representations, thereby enhancing entity disambiguation by incorporating global semantic context. Empirical evaluations on multilingual benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsSoftmax · Attention Is All You Need · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Bidirectional LSTM
