Disambiguation of Company names via Deep Recurrent Networks
Alessandro Basile, Riccardo Crupi, Michele Grasso, Alessandro, Mercanti, Daniele Regoli, Simone Scarsi, Shuyi Yang, Andrea Cosentini

TL;DR
This paper introduces a Siamese LSTM network for company name disambiguation, demonstrating improved accuracy over benchmarks and showing active learning reduces labeling effort while maintaining performance.
Contribution
It presents a novel deep recurrent network approach combined with active learning for efficient company name disambiguation.
Findings
Siamese LSTM outperforms standard string matching methods.
Active learning reduces labeling effort while maintaining accuracy.
Model reaches performance saturation with less labeled data.
Abstract
Name Entity Disambiguation is the Natural Language Processing task of identifying textual records corresponding to the same Named Entity, i.e. real-world entities represented as a list of attributes (names, places, organisations, etc.). In this work, we face the task of disambiguating companies on the basis of their written names. We propose a Siamese LSTM Network approach to extract -- via supervised learning -- an embedding of company name strings in a (relatively) low dimensional vector space and use this representation to identify pairs of company names that actually represent the same company (i.e. the same Entity). Given that the manual labelling of string pairs is a rather onerous task, we analyse how an Active Learning approach to prioritise the samples to be labelled leads to a more efficient overall learning pipeline. With empirical investigations, we show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Topic Modeling · Text and Document Classification Technologies
MethodsTanh Activation · Sigmoid Activation · Siamese Network · Long Short-Term Memory
