On the Usability of Transformers-based models for a French   Question-Answering task

Oralie Cattan; Christophe Servan; Sophie Rosset

arXiv:2207.09150·cs.CL·July 20, 2022

On the Usability of Transformers-based models for a French Question-Answering task

Oralie Cattan, Christophe Servan, Sophie Rosset

PDF

1 Models

TL;DR

This paper evaluates the usability of Transformer-based models for French question-answering tasks, focusing on resource efficiency, data scarcity challenges, and proposing a new compact model for low-resource French NLP applications.

Contribution

It provides a comprehensive assessment of Transformer models' performance on French QA and introduces a new compact French model optimized for low-resource scenarios.

Findings

01

Data augmentation improves French QA performance.

02

Hyperparameter tuning enhances model stability.

03

The new FrALBERT model is competitive in low-resource settings.

Abstract

For many tasks, state-of-the-art results have been achieved with Transformer-based architectures, resulting in a paradigmatic shift in practices from the use of task-specific architectures to the fine-tuning of pre-trained language models. The ongoing trend consists in training models with an ever-increasing amount of data and parameters, which requires considerable resources. It leads to a strong search to improve resource efficiency based on algorithmic and hardware improvements evaluated only for English. This raises questions about their usability when applied to small-scale learning problems, for which a limited amount of training data is available, especially for under-resourced languages tasks. The lack of appropriately sized corpora is a hindrance to applying data-driven and transfer learning-based approaches with strong instability cases. In this paper, we establish a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
cservan/french-albert-base-cased
model· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.