Logits-Constrained Framework with RoBERTa for Ancient Chinese NER

Wenjie Hua; Shenghan Xu

arXiv:2505.02983·cs.CL·May 7, 2025

Logits-Constrained Framework with RoBERTa for Ancient Chinese NER

Wenjie Hua, Shenghan Xu

PDF

Open Access

TL;DR

This paper introduces a Logits-Constrained framework using RoBERTa for Ancient Chinese NER, enhancing label transition validity and outperforming traditional methods on the EvaHan 2025 benchmark.

Contribution

It proposes a novel two-stage LC framework with a differentiable decoder and a model selection criterion tailored for Ancient Chinese NER tasks.

Findings

01

LC outperforms CRF and BiLSTM models

02

Effective in high-label and large-data scenarios

03

Provides practical model selection guidance

Abstract

This paper presents a Logits-Constrained (LC) framework for Ancient Chinese Named Entity Recognition (NER), evaluated on the EvaHan 2025 benchmark. Our two-stage model integrates GujiRoBERTa for contextual encoding and a differentiable decoding mechanism to enforce valid BMES label transitions. Experiments demonstrate that LC improves performance over traditional CRF and BiLSTM-based approaches, especially in high-label or large-data settings. We also propose a model selection criterion balancing label complexity and dataset size, providing practical guidance for real-world Ancient Chinese NLP tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsConditional Random Field