TL;DR
This paper introduces OpenTag, a deep learning model for extracting new attribute values from product profiles, utilizing active learning to minimize annotation effort and outperform existing methods.
Contribution
OpenTag is a novel sequence tagging model that leverages RNNs and CRFs, with an attention mechanism and active learning, to discover unseen attribute values with limited supervision.
Findings
Achieves 83% F-score in real-world datasets
Reduces annotation effort by 3.3 times
Outperforms state-of-the-art models
Abstract
Extraction of missing attribute values is to find values describing an attribute of interest from a free text input. Most past related work on extraction of missing attribute values work with a closed world assumption with the possible set of values known beforehand, or use dictionaries of values and hand-crafted features. How can we discover new attribute values that we have never seen before? Can we do this with limited human annotation or supervision? We study this problem in the context of product catalogs that often have missing values for many attributes of interest. In this work, we leverage product profile information such as titles and descriptions to discover missing values of product attributes. We develop a novel deep tagging model OpenTag for this extraction problem with the following contributions: (1) we formalize the problem as a sequence tagging task, and propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
