The Mathematics of Text Structure
Bob Coecke

TL;DR
This paper extends the DisCoCat framework to DisCoCirc, modeling how sentences interact within texts to produce overall meaning, with a quantum-inspired formalism adaptable to quantum computing implementations.
Contribution
It introduces DisCoCirc, a novel mathematical foundation for text-level meaning, building on and generalizing the DisCoCat model to include evolving word meanings within texts.
Findings
DisCoCirc models text as a string diagram of evolving word meanings.
The formalism is highly quantum-inspired and suitable for quantum computer implementation.
It generalizes previous models to handle dynamic, context-dependent meanings.
Abstract
In previous work we gave a mathematical foundation, referred to as DisCoCat, for how words interact in a sentence in order to produce the meaning of that sentence. To do so, we exploited the perfect structural match of grammar and categories of meaning spaces. Here, we give a mathematical foundation, referred to as DisCoCirc, for how sentences interact in texts in order to produce the meaning of that text. First we revisit DisCoCat. While in DisCoCat all meanings are fixed as states (i.e. have no input), in DisCoCirc word meanings correspond to a type, or system, and the states of this system can evolve. Sentences are gates within a circuit which update the variable meanings of those words. Like in DisCoCat, word meanings can live in a variety of spaces e.g. propositional, vectorial, or cognitive. The compositional structure are string diagrams representing information flows, and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
