SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Junjie Wu; Jiangnan Li; Yuqing Li; Lemao Liu; Liyan Xu; Jiwei Li; Dit-Yan Yeung; Jie Zhou; Mo Yu

arXiv:2508.01959·cs.CL·April 22, 2026

SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Junjie Wu, Jiangnan Li, Yuqing Li, Lemao Liu, Liyan Xu, Jiwei Li, Dit-Yan Yeung, Jie Zhou, Mo Yu

PDF

TL;DR

SitEmb-v1.5 introduces a context-conditioned embedding approach that significantly improves retrieval accuracy for long documents and stories, outperforming larger models on a new benchmark.

Contribution

The paper presents a novel training paradigm and situated embedding models (SitEmb) that enhance retrieval by encoding chunks within their broader context.

Findings

01

SitEmb-v1 outperforms state-of-the-art models on a book-plot retrieval dataset.

02

SitEmb-v1.5 improves performance by over 10% across multiple languages.

03

The approach is effective for downstream applications requiring localized evidence.

Abstract

Retrieval-augmented generation (RAG) over long documents typically involves splitting the text into smaller chunks, which serve as the basic units for retrieval. However, due to dependencies across the original document, contextual information is often essential for accurately interpreting each chunk. To address this, prior work has explored encoding longer context windows to produce embeddings for longer chunks. Despite these efforts, gains in retrieval and downstream tasks remain limited. This is because (1) longer chunks strain the capacity of embedding models due to the increased amount of information they must encode, and (2) many real-world applications still require returning localized evidence due to constraints on model or human bandwidth. We propose an alternative approach to this challenge by representing short chunks in a way that is conditioned on a broader context window…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.