PandaDB: Understanding Unstructured Data in Graph Database
Zihao Zhao, Zhihong Shen, Mingjie Tang, Chuan Hu, Huajin Wang,, Yuanchun Zhou

TL;DR
PandaDB is an open-source graph database designed to unify the management and querying of both structured and unstructured data, enabling semantic queries and efficient processing in industrial applications.
Contribution
This work introduces a novel graph data model, query language, and optimization techniques for integrated structured and unstructured data management in a single database system.
Findings
Supports large-scale unstructured data queries efficiently
Improves query processing speed through new optimization techniques
Widely applicable in industrial domains like FinTech and recommendation systems
Abstract
Unstructured data(e.g., images, videos, PDF files, etc.) contain semantic information, for example, the facial feature of a person and the plate number of a vehicle. There could be semantic relationships between data items which are not explicitly represented. For example, a person's face may appear in two irrelevant photos. Also, much information is represented as structured data(e.g., the person's name and age). End-users prefer to query the semantic information from unstructured data together with structured data based on the potential relationships among them. However, due to the lack of a unified database system for structured and unstructured data, developers have to comprise multiple systems and runtime together to answer these queries. In this work, we build an open-source graph database named PandaDB to consistently manage and query structured and unstructured data. We first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Image and Video Retrieval Techniques · Advanced Graph Neural Networks
