Thistle: A Vector Database in Rust
Brad Windsor, Kevin Choi

TL;DR
Thistle is a fully functional vector database implemented in Rust, designed to enhance search query responses using latent knowledge, with benchmark results on the MS MARCO dataset.
Contribution
It introduces Thistle, a new vector database in Rust that incorporates well-known algorithms and benchmarks its performance on a major dataset.
Findings
Benchmark results on MS MARCO dataset
Clarifies the latent knowledge domain
Highlights Rust ML ecosystem growth
Abstract
We present Thistle, a fully functional vector database. Thistle is an entry into the domain of latent knowledge use in answering search queries, an ongoing research topic at both start-ups and search engine companies. We implement Thistle with several well-known algorithms, and benchmark results on the MS MARCO dataset. Results help clarify the latent knowledge domain as well as the growing Rust ML ecosystem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Algorithms · Information Retrieval and Search Behavior
