Cabbage Sweeter than Cake? Analysing the Potential of Large Language   Models for Learning Conceptual Spaces

Usashi Chatterjee; Amit Gajbhiye; Steven Schockaert

arXiv:2310.05481·cs.CL·October 10, 2023

Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces

Usashi Chatterjee, Amit Gajbhiye, Steven Schockaert

PDF

Open Access

TL;DR

This paper investigates the capacity of large language models to learn conceptual spaces, finding that fine-tuned BERT models can perform comparably or better than larger GPT-3 models in representing perceptual features.

Contribution

It demonstrates that smaller, fine-tuned BERT models can effectively learn conceptual spaces, challenging the assumption that larger models are always superior for this task.

Findings

01

Fine-tuned BERT models match or outperform GPT-3 in learning conceptual spaces.

02

LLMs can learn meaningful perceptual representations to some extent.

03

Smaller models can be more effective than larger ones when fine-tuned appropriately.

Abstract

The theory of Conceptual Spaces is an influential cognitive-linguistic framework for representing the meaning of concepts. Conceptual spaces are constructed from a set of quality dimensions, which essentially correspond to primitive perceptual features (e.g. hue or size). These quality dimensions are usually learned from human judgements, which means that applications of conceptual spaces tend to be limited to narrow domains (e.g. modelling colour or taste). Encouraged by recent findings about the ability of Large Language Models (LLMs) to learn perceptually grounded representations, we explore the potential of such models for learning conceptual spaces. Our experiments show that LLMs can indeed be used for learning meaningful representations to some extent. However, we also find that fine-tuned models of the BERT family are able to match or even outperform the largest GPT-3 model,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Dropout · Weight Decay · {Dispute@FaQ-s}How to file a dispute with Expedia? · Softmax · Byte Pair Encoding · Linear Warmup With Cosine Annealing