A Quantum Many-body Wave Function Inspired Language Modeling Approach

Peng Zhang; Zhan Su; Lipeng Zhang; Benyou Wang; Dawei Song

arXiv:1808.09891·cs.CL·September 5, 2018

A Quantum Many-body Wave Function Inspired Language Modeling Approach

Peng Zhang, Zhan Su, Lipeng Zhang, Benyou Wang, Dawei Song

PDF

1 Repo

TL;DR

This paper introduces a quantum many-body wave function inspired language model that models complex word interactions and integrates CNNs, showing improved performance on question answering datasets over existing quantum-inspired and CNN-based models.

Contribution

It proposes a novel QMWF-inspired language modeling approach that captures multi-meaning word interactions and provides a theoretical foundation for CNN integration.

Findings

01

Outperforms state-of-the-art quantum-inspired LMs and CNN-based methods on QA datasets.

02

Models interactions among words using tensor product.

03

Reveals the necessity of CNNs in QMWF language modeling.

Abstract

The recently proposed quantum language model (QLM) aimed at a principled approach to modeling term dependency by applying the quantum probability theory. The latest development for a more effective QLM has adopted word embeddings as a kind of global dependency information and integrated the quantum-inspired idea in a neural network architecture. While these quantum-inspired LMs are theoretically more general and also practically effective, they have two major limitations. First, they have not taken into account the interaction among words with multiple meanings, which is common and important in understanding natural language text. Second, the integration of the quantum-inspired LM with the neural network was mainly for effective training of parameters, yet lacking a theoretical foundation accounting for such integration. To address these two issues, in this paper, we propose a Quantum…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TJUIRLAB/CIKM2018_QMWFLM
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.