Sparse Named Entity Classification using Factorization Machines

Ai Hirata; Mamoru Komachi

arXiv:1703.04879·cs.CL·March 16, 2017·2 cites

Sparse Named Entity Classification using Factorization Machines

Ai Hirata, Mamoru Komachi

PDF

Open Access

TL;DR

This paper introduces a matrix factorization approach for named entity classification that effectively handles data sparsity, achieving competitive accuracy with fewer features.

Contribution

It proposes a novel application of factorization machines to improve named entity classification under sparse data conditions.

Findings

01

Achieves competitive accuracy with fewer features.

02

Outperforms traditional models on sparse data.

03

Demonstrates effectiveness of matrix factorization in NLP.

Abstract

Named entity classification is the task of classifying text-based elements into various categories, including places, names, dates, times, and monetary values. A bottleneck in named entity classification, however, is the data problem of sparseness, because new named entities continually emerge, making it rather difficult to maintain a dictionary for named entity classification. Thus, in this paper, we address the problem of named entity classification using matrix factorization to overcome the problem of feature sparsity. Experimental results show that our proposed model, with fewer features and a smaller size, achieves competitive accuracy to state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Web Data Mining and Analysis · Algorithms and Data Compression