vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

Jayson Steffens

arXiv:2604.15484·cs.IR·April 20, 2026

vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

Jayson Steffens

PDF

1 Repo 4 Models

TL;DR

Vstash is a hybrid document retrieval system combining vector similarity and keyword matching, with adaptive fusion and self-supervised embedding refinement, achieving improved retrieval performance and efficiency.

Contribution

It introduces a self-supervised training method for hybrid retrieval, adaptive reciprocal rank fusion with per-query IDF weighting, and a production-grade, open-source implementation.

Findings

01

Self-supervised disagreement-based embedding refinement improves NDCG@10.

02

Adaptive RRF with IDF weighting outperforms fixed weights in retrieval accuracy.

03

The system achieves stable, low-latency retrieval with open-source code and validation on large datasets.

Abstract

We present **vstash**, a local-first document memory system that combines vector similarity search with full-text keyword matching via Reciprocal Rank Fusion (RRF) and adaptive per-query IDF weighting. All data resides in a single SQLite file using sqlite-vec for approximate nearest neighbor search and FTS5 for keyword matching. We make four primary contributions. **(1)** Self-supervised embedding refinement via hybrid retrieval disagreement: across 753 BEIR queries on SciFact, NFCorpus, and FiQA, 74.5% produce top-10 disagreement between vector-heavy (vec=0.95, fts=0.05) and FTS-heavy (vec=0.05, fts=0.95) search (per-dataset rates 63.4% / 73.4% / 86.7%, Section 5.2), providing a free training signal without human labels. Fine-tuning BGE-small (33M params) with MultipleNegativesRankingLoss on 76K disagreement triples improves NDCG@10 on all 5 BEIR datasets (up to +19.5% on NFCorpus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://huggingface.co/Stffens/bge-small-rrf-v2
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.