Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Wenjie Hua, Shenghan Xu

TL;DR
This paper introduces a Logits-Constrained framework using RoBERTa for Ancient Chinese NER, enhancing label transition validity and outperforming traditional methods on the EvaHan 2025 benchmark.
Contribution
It proposes a novel two-stage LC framework with a differentiable decoder and a model selection criterion tailored for Ancient Chinese NER tasks.
Findings
LC outperforms CRF and BiLSTM models
Effective in high-label and large-data scenarios
Provides practical model selection guidance
Abstract
This paper presents a Logits-Constrained (LC) framework for Ancient Chinese Named Entity Recognition (NER), evaluated on the EvaHan 2025 benchmark. Our two-stage model integrates GujiRoBERTa for contextual encoding and a differentiable decoding mechanism to enforce valid BMES label transitions. Experiments demonstrate that LC improves performance over traditional CRF and BiLSTM-based approaches, especially in high-label or large-data settings. We also propose a model selection criterion balancing label complexity and dataset size, providing practical guidance for real-world Ancient Chinese NLP tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsConditional Random Field
