Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks
Wei-Jie Huang, Chao-Lin Liu

TL;DR
This paper investigates the use of lexical, syntactic, and semantic features for Chinese textual entailment, employing heuristics and machine learning, achieving robust results in NTCIR RITE tasks.
Contribution
It introduces a comprehensive feature set and compares heuristics-based and machine learning approaches for Chinese textual entailment.
Findings
Achieved second place in NTCIR-10 RITE-2 binary classification.
Demonstrated robustness across traditional and simplified Chinese.
Extended analysis of feature contributions and classifier configurations.
Abstract
We computed linguistic information at the lexical, syntactic, and semantic levels for Recognizing Inference in Text (RITE) tasks for both traditional and simplified Chinese in NTCIR-9 and NTCIR-10. Techniques for syntactic parsing, named-entity recognition, and near synonym recognition were employed, and features like counts of common words, statement lengths, negation words, and antonyms were considered to judge the entailment relationships of two statements, while we explored both heuristics-based functions and machine-learning approaches. The reported systems showed robustness by simultaneously achieving second positions in the binary-classification subtasks for both simplified and traditional Chinese in NTCIR-10 RITE-2. We conducted more experiments with the test data of NTCIR-9 RITE, with good results. We also extended our work to search for better configurations of our classifiers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
