Leveraging Translation For Optimal Recall: Tailoring LLM Personalization With User Profiles
Karthik Ravichandran, Sarmistha Sarna Gomasta

TL;DR
This paper introduces a personalized cross-language information retrieval method that enhances recall by combining multi-level translation, semantic expansion, and user profiles, leading to improved retrieval performance on news and social media datasets.
Contribution
The paper presents a novel iterative query refinement technique that integrates translation, semantic embedding, and user profiles for personalized CLIR, outperforming baseline methods.
Findings
Superior performance over BM25 baseline in ROUGE metrics
Maintained semantic accuracy through multi-step translation process
Effective personalization using user profiles in cross-language retrieval
Abstract
This paper explores a novel technique for improving recall in cross-language information retrieval (CLIR) systems using iterative query refinement grounded in the user's lexical-semantic space. The proposed methodology combines multi-level translation, semantic embedding-based expansion, and user profile-centered augmentation to address the challenge of matching variance between user queries and relevant documents. Through an initial BM25 retrieval, translation into intermediate languages, embedding lookup of similar terms, and iterative re-ranking, the technique aims to expand the scope of potentially relevant results personalized to the individual user. Comparative experiments on news and Twitter datasets demonstrate superior performance over baseline BM25 ranking for the proposed approach across ROUGE metrics. The translation methodology also showed maintained semantic accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Semantic Web and Ontologies · Digital Rights Management and Security
