PREDICT: Persian Reverse Dictionary
Arman Malekzadeh, Amin Gheibi, Ali Mohades

TL;DR
This paper develops and evaluates the first Persian reverse dictionary, PREDICT, using neural network models, and introduces a new metric to measure synonym accuracy, achieving over 62% top-100 suggestion accuracy.
Contribution
It presents the first Persian reverse dictionary and compares four architectures, highlighting an LSTM with attention as the most effective model, along with a new evaluation metric.
Findings
LSTM with attention performs best among tested models.
The model achieves over 62% accuracy within top 100 suggestions.
A new synonym accuracy metric effectively evaluates reverse dictionary performance.
Abstract
Finding the appropriate words to convey concepts (i.e., lexical access) is essential for effective communication. Reverse dictionaries fulfill this need by helping individuals to find the word(s) which could relate to a specific concept or idea. To the best of our knowledge, this resource has not been available for the Persian language. In this paper, we compare four different architectures for implementing a Persian reverse dictionary (PREDICT). We evaluate our models using (phrase,word) tuples extracted from the only Persian dictionaries available online, namely Amid, Moein, and Dehkhoda where the phrase describes the word. Given the phrase, a model suggests the most relevant word(s) in terms of the ability to convey the concept. The model is considered to perform well if the correct word is one of its top suggestions. Our experiments show that a model consisting of Long…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques
MethodsTanh Activation
