OpenHowNet: An Open Sememe-based Lexical Knowledge Base
Fanchao Qi, Chenghao Yang, Zhiyuan Liu, Qiang Dong, Maosong Sun,, Zhendong Dong

TL;DR
OpenHowNet is an open lexical knowledge base built on sememes, providing extensive sense annotations, online tools, and APIs to facilitate research and applications in semantic understanding.
Contribution
It introduces OpenHowNet, an open, sememe-based lexical resource with core data, web interface, and APIs, expanding access and usability of sememe annotations for NLP research.
Findings
Contains over 100,000 senses with sememe annotations
Provides online tools and APIs for accessing and visualizing data
Facilitates research in sememe-based semantic analysis
Abstract
In this paper, we present an open sememe-based lexical knowledge base OpenHowNet. Based on well-known HowNet, OpenHowNet comprises three components: core data which is composed of more than 100 thousand senses annotated with sememes, OpenHowNet Web which gives a brief introduction to OpenHowNet as well as provides online exhibition of OpenHowNet information, and OpenHowNet API which includes several useful APIs such as accessing OpenHowNet core data and drawing sememe tree structures of senses. In the main text, we first give some backgrounds including definition of sememe and details of HowNet. And then we introduce some previous HowNet and sememe-based research works. Last but not least, we detail the constituents of OpenHowNet and their basic features and functionalities. Additionally, we briefly make a summary and list some future works.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Advanced Text Analysis Techniques · Topic Modeling
