Analysis of Japanese Compound Nouns using Collocational Information
Kobayasi Yosiyuki, Takunaga Takenobu, Tanaka Hozumi

TL;DR
This paper presents a method for analyzing Japanese compound nouns by leveraging collocational statistics and a thesaurus, achieving around 80% accuracy in a large-scale experiment.
Contribution
It introduces a novel approach combining collocational data and thesaurus information for Japanese compound noun analysis.
Findings
Achieved approximately 80% accuracy in compound noun analysis.
Utilized 160,000 word collocations for the experiment.
Analyzed compound nouns with an average length of 4.9 characters.
Abstract
Analyzing compound nouns is one of the crucial issues for natural language processing systems, in particular for those systems that aim at a wide coverage of domains. In this paper, we propose a method to analyze structures of Japanese compound nouns by using both word collocations statistics and a thesaurus. An experiment is conducted with 160,000 word collocations to analyze compound nouns of with an average length of 4.9 characters. The accuracy of this method is about 80%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Lexicography and Language Studies · Second Language Acquisition and Learning
