Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval
Nimol Thuon

TL;DR
This paper introduces the first Khmer Semantic Search Engine (KSE) that significantly improves document retrieval accuracy for Khmer content by employing semantic matching techniques and formal annotations.
Contribution
The paper presents novel semantic search frameworks tailored for Khmer language, including keyword dictionary, ontology-based, and ranking methods, along with data preparation tools.
Findings
Semantic understanding improves search accuracy.
Three frameworks enhance Khmer search capabilities.
Ground truth dataset validates performance improvements.
Abstract
The search engine process is crucial for document content retrieval. For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches. Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool. Even Google does not deliver high accuracy for Khmer content. Semantic search engines improve search results by employing advanced algorithms to understand various content types. With the rise in Khmer digital content such as reports, articles, and social media feedback enhanced search capabilities are essential. This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods. Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Web Data Mining and Analysis · Data Quality and Management
