Bilingual Terminology Extraction Using Multi-level Termhood
Chengzhi Zhang, Dan Wu

TL;DR
This paper introduces a multi-level termhood approach for improved monolingual terminology extraction and bilingual term alignment, enhancing performance in cross-language applications like machine translation and information retrieval.
Contribution
It proposes a novel multi-level termhood method that considers term and sentence distributions to improve terminology extraction and bilingual alignment accuracy.
Findings
Multi-level termhood outperforms existing methods in terminology extraction.
Using termhood as a constraint improves bilingual term alignment.
Experimental results demonstrate enhanced performance in both tasks.
Abstract
Purpose: Terminology is the set of technical words or expressions used in specific contexts, which denotes the core concept in a formal discipline and is usually applied in the fields of machine translation, information retrieval, information extraction and text categorization, etc. Bilingual terminology extraction plays an important role in the application of bilingual dictionary compilation, bilingual Ontology construction, machine translation and cross-language information retrieval etc. This paper addresses the issues of monolingual terminology extraction and bilingual term alignment based on multi-level termhood. Design/methodology/approach: A method based on multi-level termhood is proposed. The new method computes the termhood of the terminology candidate as well as the sentence that includes the terminology by the comparison of the corpus. Since terminologies and general words…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
