Do Language Models Understand Measurements?

Sungjin Park; Seungwoo Ryu; Edward Choi

arXiv:2210.12694·cs.CL·October 25, 2022

Do Language Models Understand Measurements?

Sungjin Park, Seungwoo Ryu, Edward Choi

PDF

Open Access

TL;DR

This paper investigates whether pre-trained language models can understand measurements, finds they lack this ability, but can improve with a new embedding strategy and measurement-rich training data.

Contribution

It introduces a simple embedding method to enhance measurement understanding and demonstrates the importance of measurement-rich training data for PLMs.

Findings

01

PLMs lack inherent measurement reasoning capabilities

02

Training on measurement-rich corpora improves understanding

03

Embedding strategies significantly boost measurement comprehension

Abstract

Recent success of pre-trained language models (PLMs) has stimulated interest in their ability to understand and work with numbers. Yet, the numerical reasoning over measurements has not been formally studied despite their importance. In this study, we show that PLMs lack the capability required for reasoning over measurements. Furthermore, we find that a language model trained on a measurement-rich corpus shows better performance on understanding measurements. We propose a simple embedding strategy to better distinguish between numbers and units, which leads to a significant improvement in the probing tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational Physics and Python Applications