Probing for Multilingual Numerical Understanding in Transformer-Based   Language Models

Devin Johnson; Denise Mak; Drew Barker; Lexi Loessberg-Zahl

arXiv:2010.06666·cs.CL·October 15, 2020

Probing for Multilingual Numerical Understanding in Transformer-Based Language Models

Devin Johnson, Denise Mak, Drew Barker, Lexi Loessberg-Zahl

PDF

1 Repo

TL;DR

This paper investigates whether multilingual transformer models understand numerical compositionality in language by testing their ability to perform grammaticality and value comparison tasks across several languages.

Contribution

It introduces novel multilingual probing tasks to evaluate the models' understanding of numerical structures and compositional reasoning in natural language numbers.

Findings

01

Models encode enough information for grammaticality judgments.

02

Models generally lack sufficient information for accurate value comparisons.

03

Performance varies across languages and tasks.

Abstract

Natural language numbers are an example of compositional structures, where larger numbers are composed of operations on smaller numbers. Given that compositional reasoning is a key to natural language understanding, we propose novel multilingual probing tasks tested on DistilBERT, XLM, and BERT to investigate for evidence of compositional reasoning over numerical data in various natural language number systems. By using both grammaticality judgment and value comparison classification tasks in English, Japanese, Danish, and French, we find evidence that the information encoded in these pretrained models' embeddings is sufficient for grammaticality judgments but generally not for value comparisons. We analyze possible reasons for this and discuss how our tasks could be extended in further studies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dj1121/tlm_num_probe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Adam · Byte Pair Encoding · Softmax · Layer Normalization · Dense Connections · Multi-Head Attention · XLM · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia?