TANKER: Distributed Architecture for Named Entity Recognition and Disambiguation
Sandro A. Coelho, Diego Moussallem, Gustavo C. Publio, Diego, Esteves

TL;DR
TANKER is a distributed, micro-services-based architecture that standardizes interfaces and combines multiple NERD systems to improve scalability, reliability, and flexibility for industrial applications.
Contribution
It introduces a standardized API and a distributed architecture to integrate multiple NERD systems, addressing scalability and reliability challenges in industrial contexts.
Findings
Enables integration of multiple NERD systems seamlessly
Improves scalability and fault tolerance for large-scale NERD applications
Provides a standardized API for easier system interoperability
Abstract
Named Entity Recognition and Disambiguation (NERD) systems have recently been widely researched to deal with the significant growth of the Web. NERD systems are crucial for several Natural Language Processing (NLP) tasks such as summarization, understanding, and machine translation. However, there is no standard interface specification, i.e. these systems may vary significantly either for exporting their outputs or for processing the inputs. Thus, when a given company desires to implement more than one NERD system, the process is quite exhaustive and prone to failure. In addition, industrial solutions demand critical requirements, e.g., large-scale processing, completeness, versatility, and licenses. Commonly, these requirements impose a limitation, making good NERD models to be ignored by companies. This paper presents TANKER, a distributed architecture which aims to overcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsService-Oriented Architecture and Web Services · Topic Modeling · Natural Language Processing Techniques
