Do ever larger octopi still amplify reporting biases? Evidence from   judgments of typical colour

Fangyu Liu; Julian Martin Eisenschlos; Jeremy R. Cole; Nigel Collier

arXiv:2209.12786·cs.CL·September 27, 2022·1 cites

Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour

Fangyu Liu, Julian Martin Eisenschlos, Jeremy R. Cole, Nigel Collier

PDF

Open Access

TL;DR

This study investigates whether larger language models still amplify reporting biases, specifically in judging the typical colour of objects, and finds that larger models better align with human judgments, counter to prior assumptions.

Contribution

The paper demonstrates that scaling up language models reduces reporting bias in perceptual tasks, showing larger models can better reflect human-like physical common sense.

Findings

01

Larger LLMs outperform smaller models in typical colour judgments.

02

Larger models more closely match human judgments.

03

Scaling models mitigates certain reporting biases.

Abstract

Language models (LMs) trained on raw texts have no direct access to the physical world. Gordon and Van Durme (2013) point out that LMs can thus suffer from reporting bias: texts rarely report on common facts, instead focusing on the unusual aspects of a situation. If LMs are only trained on text corpora and naively memorise local co-occurrence statistics, they thus naturally would learn a biased view of the physical world. While prior studies have repeatedly verified that LMs of smaller scales (e.g., RoBERTa, GPT-2) amplify reporting bias, it remains unknown whether such trends continue when models are scaled up. We investigate reporting bias from the perspective of colour in larger language models (LLMs) such as PaLM and GPT-3. Specifically, we query LLMs for the typical colour of objects, which is one simple type of perceptually grounded physical common sense. Surprisingly, we find…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Pathways Language Model · Linear Layer · Cosine Annealing · WordPiece · Layer Normalization · Linear Warmup With Linear Decay