Toward Agentic RAG for Ukrainian

Marta Sumyk; Oleksandr Kosovan

arXiv:2604.14896·cs.AI·April 17, 2026

Toward Agentic RAG for Ukrainian

Marta Sumyk, Oleksandr Kosovan

PDF

TL;DR

This paper investigates agentic retrieval-augmented generation for Ukrainian, highlighting retrieval quality as a key bottleneck and proposing a system combining two-stage retrieval with an agentic layer.

Contribution

It introduces a novel system combining two-stage retrieval with an agentic layer for Ukrainian, analyzing its limitations and potential improvements.

Findings

01

Retrieval quality is the main bottleneck for system performance.

02

Agentic retry mechanisms improve answer accuracy.

03

Overall scores are limited by document and page identification.

Abstract

We present an initial investigation into Agentic Retrieval-Augmented Generation (RAG) for Ukrainian, conducted within the UNLP 2026 Shared Task on Multi-Domain Document Understanding. Our system combines two-stage retrieval (BGE-M3 with BGE reranking) with a lightweight agentic layer performing query rephrasing and answer-retry loops on top of Qwen2.5-3B-Instruct. Our analysis reveals that retrieval quality is the primary bottleneck: agentic retry mechanisms improve answer accuracy but the overall score remains constrained by document and page identification. We discuss practical limitations of offline agentic pipelines and outline directions for combining stronger retrieval with more advanced agentic reasoning for Ukrainian.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.