An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation

Mykola Trokhymovych; Yana Oliinyk; Nazarii Nyzhnyk

arXiv:2604.22095·cs.CL·April 27, 2026

An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation

Mykola Trokhymovych, Yana Oliinyk, Nazarii Nyzhnyk

PDF

TL;DR

This paper introduces an efficient Ukrainian RAG system with a custom search pipeline and lightweight model, enabling high-quality question answering on limited hardware, achieving 2nd place in a shared task.

Contribution

It presents a novel end-to-end Ukrainian RAG architecture with optimized search and model compression for local deployment.

Findings

01

Achieved 2nd place in the UNLP 2026 Shared Task.

02

Demonstrated high-quality QA on resource-constrained hardware.

03

Developed a specialized Ukrainian language model fine-tuned on synthetic data.

Abstract

This paper presents a highly efficient Retrieval-Augmented Generation (RAG) system built specifically for Ukrainian document question answering, which achieved 2nd place in the UNLP 2026 Shared Task. Our solution features a custom two-stage search pipeline that retrieves relevant document pages, paired with a specialized Ukrainian language model fine-tuned on synthetic data to generate accurate, grounded answers. Finally, we compress the model for lightweight deployment. Evaluated under strict computational limits, our architecture demonstrates that high-quality, verifiable AI question answering can be achieved locally on resource-constrained hardware without sacrificing accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.