Quantum-Like Contextuality in Large Language Models
Kin Ian Lo, Mehrnoosh Sadrzadeh, Shane Mansfield

TL;DR
This study demonstrates large-scale evidence of quantum-like contextuality in natural language using BERT, suggesting quantum methods could enhance language processing tasks.
Contribution
First large-scale experimental demonstration of quantum-like contextuality in natural language, linking it to semantic similarity and embedding distances.
Findings
Identified over 77,000 sheaf-contextual instances
Discovered nearly 37 million CbD contextual instances
Established a relationship between contextuality and BERT embedding distances
Abstract
Contextuality is a distinguishing feature of quantum mechanics and there is growing evidence that it is a necessary condition for quantum advantage. In order to make use of it, researchers have been asking whether similar phenomena arise in other domains. The answer has been yes, e.g. in behavioural sciences. However, one has to move to frameworks that take some degree of signalling into account. Two such frameworks exist: (1) a signalling-corrected sheaf theoretic model, and (2) the Contextuality-by-Default (CbD) framework. This paper provides the first large scale experimental evidence for a yes answer in natural language. We construct a linguistic schema modelled over a contextual quantum scenario, instantiate it in the Simple English Wikipedia and extract probability distributions for the instances using the large language model BERT. This led to the discovery of 77,118…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · Dense Connections · Multi-Head Attention · Residual Connection · Adam · Layer Normalization · Weight Decay
