Vector Search for the Future: From Memory-Resident, Static Heterogeneous Storage, to Cloud-Native Architectures
Yitong Song, Xuanhe Zhou, Christian S. Jensen, Jianliang Xu

TL;DR
This paper reviews the evolution of vector search architectures from in-memory to cloud-native, highlighting how storage innovations address scalability, cost, and performance challenges in large-scale multimodal data retrieval.
Contribution
It provides a systematic overview of vector search techniques across different storage architectures and discusses future directions for cloud-native, trillion-scale retrieval systems.
Findings
Memory-resident VS methods are fast but limited in scalability.
Heterogeneous storage architectures improve scalability and cost-efficiency.
Emerging cloud-native systems enable seamless scalability for trillion-scale data.
Abstract
Vector search (VS) has become a fundamental component in multimodal data management, enabling core functionalities such as image, video, and code retrieval. As vector data scales rapidly, VS faces growing challenges in balancing search, latency, scalability, and cost. The evolution of VS has been closely driven by changes in storage architecture. Early VS methods rely on all-in-memory designs for low latency, but scalability is constrained by memory capacity and cost. To address this, recent research has adopted heterogeneous architectures that offload space-intensive vectors and index structures to SSDs, while exploiting block locality and I/O-efficient strategies to maintain high search performance at billion scale. Looking ahead, the increasing demand for trillion-scale vector retrieval and cloud-native elasticity is driving a further shift toward memory-SSD-object storage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Caching and Content Delivery · Advanced Data Storage Technologies
