AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of   Types

Xin Luna Dong; Xiang He; Andrey Kan; Xian Li; Yan Liang; Jun Ma; Yifan; Ethan Xu; Chenwei Zhang; Tong Zhao; Gabriel Blanco Saldana; Saurabh; Deshpande; Alexandre Michetti Manduca; Jay Ren; Surender Pal Singh; Fan Xiao,; Haw-Shiuan Chang; Giannis Karamanolakis; Yuning Mao; Yaqing Wang; Christos; Faloutsos; Andrew McCallum; Jiawei Han

arXiv:2006.13473·cs.AI·June 25, 2020

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

Xin Luna Dong, Xiang He, Andrey Kan, Xian Li, Yan Liang, Jun Ma, Yifan, Ethan Xu, Chenwei Zhang, Tong Zhao, Gabriel Blanco Saldana, Saurabh, Deshpande, Alexandre Michetti Manduca, Jay Ren, Surender Pal Singh, Fan Xiao,, Haw-Shiuan Chang, Giannis Karamanolakis, Yuning Mao

PDF

TL;DR

AutoKnow is an automated system that constructs comprehensive product knowledge graphs across thousands of categories by leveraging novel techniques and customer behavior data, addressing challenges like data sparsity and heterogeneity.

Contribution

The paper introduces AutoKnow, a scalable, automatic system with innovative methods for taxonomy, property extraction, and anomaly detection in product knowledge graphs.

Findings

01

Operated over 11,000 product types

02

Effectively handles data sparsity and heterogeneity

03

Integrates customer behavior logs for enhanced knowledge extraction

Abstract

Can one build a knowledge graph (KG) for all products in the world? Knowledge graphs have firmly established themselves as valuable sources of information for search and question answering, and it is natural to wonder if a KG can contain information about products offered at online retail sites. There have been several successful examples of generic KGs, but organizing information about products poses many additional challenges, including sparsity and noise of structured data for products, complexity of the domain with millions of product types and thousands of attributes, heterogeneity across large number of categories, as well as large and constantly growing number of products. We describe AutoKnow, our automatic (self-driving) system that addresses these challenges. The system includes a suite of novel techniques for taxonomy construction, product property identification, knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.