Neural Databases
James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri,, Sebastian Riedel, Alon Halevy

TL;DR
This paper introduces NeuralDB, a schema-less database system that processes natural language queries and updates using advanced NLP models, enabling flexible and accurate data retrieval without predefined schemas.
Contribution
It presents a novel neural database architecture that operates without pre-defined schemas, utilizing parallel Neural SPJ operators and a trainable fact creation algorithm.
Findings
High accuracy in answering queries over thousands of sentences
Neural SPJ operators can be trained to create relevant fact sets
System can handle select-project-join and aggregation queries
Abstract
In recent years, neural networks have shown impressive performance gains on long-standing AI problems, and in particular, answering queries from natural language text. These advances raise the question of whether they can be extended to a point where we can relax the fundamental assumption of database management, namely, that our data is represented as fields of a pre-defined schema. This paper presents a first step in answering that question. We describe NeuralDB, a database system with no pre-defined schema, in which updates and queries are given in natural language. We develop query processing techniques that build on the primitives offered by the state of the art Natural Language Processing methods. We begin by demonstrating that at the core, recent NLP transformers, powered by pre-trained language models, can answer select-project-join queries if they are given the exact set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
