The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder
Ihor Stepanov, Mykhailo Shtopko, Dmytro Vodianytskyi, Oleksandr Lukashov

TL;DR
This paper presents GLiNER-bi-Encoder, a scalable architecture for NER that efficiently recognizes millions of entity types with state-of-the-art zero-shot performance and significantly improved throughput.
Contribution
It introduces a bi-encoder design for NER that overcomes quadratic complexity, enabling industrial-scale recognition of vast label sets with minimal overhead.
Findings
Achieves 61.5% Micro-F1 on CrossNER benchmark.
Up to 130x throughput improvement with pre-computed label embeddings.
Enables high-performance entity linking with GLiNKER framework.
Abstract
This paper introduces GLiNER-bi-Encoder, a novel architecture for Named Entity Recognition (NER) that harmonizes zero-shot flexibility with industrial-scale efficiency. While the original GLiNER framework offers strong generalization, its joint-encoding approach suffers from quadratic complexity as the number of entity labels increases. Our proposed bi-encoder design decouples the process into a dedicated label encoder and a context encoder, effectively removing the context-window bottleneck. This architecture enables the simultaneous recognition of thousands, and potentially millions, of entity types with minimal overhead. Experimental results demonstrate state-of-the-art zero-shot performance, achieving 61.5 percent Micro-F1 on the CrossNER benchmark. Crucially, by leveraging pre-computed label embeddings, GLiNER-bi-Encoder achieves up to a 130 times throughput improvement at 1024…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗knowledgator/gliner-bi-edge-v2.0model· 84 dl· ♡ 884 dl♡ 8
- 🤗knowledgator/gliner-bi-small-v2.0model· 26 dl· ♡ 426 dl♡ 4
- 🤗knowledgator/gliner-bi-base-v2.0model· 183 dl· ♡ 5183 dl♡ 5
- 🤗knowledgator/gliner-bi-large-v2.0model· 129 dl· ♡ 16129 dl♡ 16
- 🤗knowledgator/gliner-linker-base-v1.0model· 36 dl· ♡ 536 dl♡ 5
- 🤗knowledgator/gliner-linker-large-v1.0model· 109 dl· ♡ 8109 dl♡ 8
- 🤗knowledgator/gliner-linker-rerank-v1.0model· 17 dl· ♡ 517 dl♡ 5
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Text and Document Classification Technologies
