The Faiss library

Matthijs Douze; Alexandr Guzhva; Chengqi Deng; Jeff Johnson; Gergely Szilvasy; Pierre-Emmanuel Mazar\'e; Maria Lomeli; Lucas Hosseini; Herv\'e J\'egou

arXiv:2401.08281·cs.LG·October 24, 2025·62 cites

The Faiss library

Matthijs Douze, Alexandr Guzhva, Chengqi Deng, Jeff Johnson, Gergely Szilvasy, Pierre-Emmanuel Mazar\'e, Maria Lomeli, Lucas Hosseini, Herv\'e J\'egou

PDF

Open Access 1 Repo 1 Models 2 Datasets

TL;DR

The Faiss library provides efficient indexing and search methods for large-scale vector similarity search, addressing the growing need for managing extensive embedding collections in AI applications.

Contribution

It introduces a comprehensive toolkit of indexing methods and design principles tailored for scalable vector similarity search in large datasets.

Findings

01

Benchmark results demonstrate high performance and scalability.

02

Faiss supports diverse applications in AI and data analysis.

03

The library offers flexible trade-offs between speed and accuracy.

Abstract

Vector databases typically manage large collections of embedding vectors. Currently, AI applications are growing rapidly, and so is the number of embeddings that need to be stored and indexed. The Faiss library is dedicated to vector similarity search, a core functionality of vector databases. Faiss is a toolkit of indexing methods and related primitives used to search, cluster, compress and transform vectors. This paper describes the trade-off space of vector search and the design principles of Faiss in terms of structure, approach to optimization and interfacing. We benchmark key features of the library and discuss a few selected applications to highlight its broad applicability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/faiss
pytorchOfficial

Models

🤗
imageomics/bioclip-image-search-lite
model· ♡ 2
♡ 2

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Constraint Satisfaction and Optimization · Advanced Database Systems and Queries

MethodsLib