Analysis of Japanese Compound Nouns using Collocational Information

Kobayasi Yosiyuki; Takunaga Takenobu; Tanaka Hozumi

arXiv:cmp-lg/9412008·cmp-lg·February 3, 2008·3 cites

Analysis of Japanese Compound Nouns using Collocational Information

Kobayasi Yosiyuki, Takunaga Takenobu, Tanaka Hozumi

PDF

Open Access

TL;DR

This paper presents a method for analyzing Japanese compound nouns by leveraging collocational statistics and a thesaurus, achieving around 80% accuracy in a large-scale experiment.

Contribution

It introduces a novel approach combining collocational data and thesaurus information for Japanese compound noun analysis.

Findings

01

Achieved approximately 80% accuracy in compound noun analysis.

02

Utilized 160,000 word collocations for the experiment.

03

Analyzed compound nouns with an average length of 4.9 characters.

Abstract

Analyzing compound nouns is one of the crucial issues for natural language processing systems, in particular for those systems that aim at a wide coverage of domains. In this paper, we propose a method to analyze structures of Japanese compound nouns by using both word collocations statistics and a thesaurus. An experiment is conducted with 160,000 word collocations to analyze compound nouns of with an average length of 4.9 characters. The accuracy of this method is about 80%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Lexicography and Language Studies · Second Language Acquisition and Learning