TL;DR
This paper introduces COPER, a query-adaptable, semantics-based search engine for Persian COVID-19 articles, leveraging BERT and traditional models to improve information retrieval accuracy.
Contribution
It presents a novel Persian COVID-19 article search engine that combines BERT-based models with traditional keyword methods, optimized through query adaptation and fine-tuning.
Findings
The proposed search engine outperforms baseline models significantly.
Query adaptation improves retrieval relevance.
Fine-tuning with Semantic Textual Similarity enhances performance.
Abstract
With the surge of pretrained language models, a new pathway has been opened to incorporate Persian text contextual information. Meanwhile, as many other countries, including Iran, are fighting against COVID-19, a plethora of COVID-19 related articles has been published in Iranian Healthcare magazines to better inform the public of the situation. However, finding answers in this sheer volume of information is an extremely difficult task. In this paper, we collected a large dataset of these articles, leveraged different BERT variations as well as other keyword models such as BM25 and TF-IDF, and created a search engine to sift through these documents and rank them, given a user's query. Our final search engine consists of a ranker and a re-ranker, which adapts itself to the query. We fine-tune our models using Semantic Textual Similarity and evaluate them with standard task metrics. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Residual Connection · Dense Connections · Softmax · WordPiece
