Named Entity Recognition on Noisy Data using Images and Text (1-page   abstract)

Diego Esteves

arXiv:1809.01964·cs.IR·September 7, 2018

Named Entity Recognition on Noisy Data using Images and Text (1-page abstract)

Diego Esteves

PDF

Open Access

TL;DR

This paper introduces a novel multi-level architecture for Named Entity Recognition on noisy Twitter data that leverages both images and text features, avoiding reliance on hand-crafted features or domain-specific rules.

Contribution

The proposed approach uniquely combines image and text features for NER, demonstrating competitive results without using linguistic resources or encoded rules.

Findings

01

Achieved 0.59 F-measure on Ritter dataset

02

Outperformed traditional methods on noisy Twitter data

03

Showed potential for improved NER in complex contexts

Abstract

Named Entity Recognition (NER) is an important subtask of information extraction that seeks to locate and recognise named entities. Despite recent achievements, we still face limitations in correctly detecting and classifying entities, prominently in short and noisy text, such as Twitter. An important negative aspect in most of NER approaches is the high dependency on hand-crafted features and domain-specific knowledge, necessary to achieve state-of-the-art results. Thus, devising models to deal with such linguistically complex contexts is still challenging. In this paper, we propose a novel multi-level architecture that does not rely on any specific linguistic resource or encoded rule. Unlike traditional approaches, we use features extracted from images and text to classify named entities. Experimental tests against state-of-the-art NER for Twitter on the Ritter dataset present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Web Data Mining and Analysis