{"id":5462,"date":"2026-06-08T10:52:06","date_gmt":"2026-06-08T17:52:06","guid":{"rendered":"https:\/\/www.couchbase.com\/blog\/?p=5462"},"modified":"2026-06-08T10:52:07","modified_gmt":"2026-06-08T17:52:07","slug":"diskann","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/pt\/diskann\/","title":{"rendered":"What Is DiskANN? Billion-Scale Vector Search Explained"},"content":{"rendered":"<p>Retrieval-augmented generation (RAG), semantic search, and AI agents all depend on one thing: the ability to quickly find the most relevant vectors in a large dataset. As embedding datasets grow from millions to billions of records, purely in-memory vector indexes become financially unsustainable. DiskANN solves this problem by storing vector indexes on SSD rather than RAM, enabling web-scale search on commodity hardware.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-is-diskann\">What is DiskANN?<\/h2>\n\n\n\n<p>DiskANN<strong> <\/strong>is a graph-based vector search algorithm developed by Microsoft Research that was first published at NeurIPS 2019. It performs approximate nearest neighbor (ANN) search over billion-scale vector datasets, using SSD as its primary storage medium and keeping only a compressed representation of the index in RAM.<\/p>\n\n\n\n<p>Before DiskANN, algorithms like HNSW and FAISS required the entire vector index to reside in DRAM. At a billion vectors, that demands hundreds of gigabytes of RAM and an infrastructure that\u2019s expensive to provision and scale. DiskANN breaks this constraint by shifting the bulk of the index to disk while maintaining recall and latency characteristics competitive with in-memory approaches. It\u2019s built on top of the Vamana algorithm, a novel graph construction method that produces a single-layer directed graph well-suited for efficient disk-based traversal.<\/p>\n\n\n\n<p><strong>Key benchmarks from the original <\/strong><a href=\"https:\/\/papers.nips.cc\/paper\/2019\"><strong>NeurIPS 2019 paper<\/strong><\/a><strong>:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>1B+<\/strong> vectors indexed on a single machine with 64GB RAM<\/li>\n\n\n\n<li><strong>5-10x<\/strong> more vectors per machine vs. DRAM-only solutions at equivalent latency<\/li>\n\n\n\n<li><strong>&lt;5 ms<\/strong> query latency with 95%+ recall on the SIFT-1B benchmark<\/li>\n<\/ul>\n\n\n\n<p>DiskANN is now the basis of vector search infrastructure at Microsoft (used in Bing and Microsoft 365) and has been adopted by <a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/releases\/\">Couchbase<\/a>, Azure Cosmos DB, Azure Database for PostgreSQL, TimescaleDB&#8217;s pgvectorscale, and other databases.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-how-diskann-works\">How DiskANN works<\/h2>\n\n\n\n<p>DiskANN combines two techniques: the Vamana graph for index structure and navigation, and product quantization (PQ) for in-memory vector compression.<\/p>\n\n\n\n<p><strong>Building the Vamana graph:<\/strong> Vamana constructs a single-layer directed graph where each node represents a vector. It initializes with random connections, then refines through two-pass pruning. The first pass removes redundant short-range edges, and the second adds long-range edges that let the search algorithm jump quickly to the right region of the graph without expensive multi-hop traversal. Unlike HNSW&#8217;s multi-layer hierarchy, DiskANN\u2019s single-layer structure makes it practical on disk.<\/p>\n\n\n\n<p><strong>The RAM\/SSD split:<\/strong> PQ-compressed vectors (e.g., 32 bytes vs. 512 bytes for a 128-dim float32 vector) are cached in RAM for fast approximate distance calculations. The full Vamana graph index and full-precision vectors live on SSD and are only read for final reranking.<\/p>\n\n\n\n<p><strong>Query execution:<\/strong> Search runs in two phases. First, the algorithm uses PQ-compressed vectors in RAM to navigate the graph and identify a candidate set \u2013 no disk reads required. Then it fetches full-precision vectors from SSD for that candidate set and computes exact distances. This two-phase design is what preserves high recall while keeping RAM requirements low.<\/p>\n\n\n\n<p><strong>FreshDiskANN:<\/strong> The original Vamana implementation produces a static index. FreshDiskANN, a follow-on from Microsoft Research, extends DiskANN to support concurrent real-time inserts, deletes, and updates without full index rebuilds. It maintains over 95% recall, making it practical for streaming datasets such as recommender systems and live document repositories.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-diskann-vs-hnsw-vs-ivf\">DiskANN vs. HNSW vs. IVF<\/h2>\n\n\n\n<p>DiskANN, HNSW, and IVF are the three dominant ANN index types in production today.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>DiskANN (Vamana)<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>HNSW<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>IVF<\/strong><\/td><\/tr><tr><td><strong>Primary storage<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">SSD + small RAM cache<\/td><td class=\"has-text-align-center\" data-align=\"center\">RAM (full index in memory)<\/td><td class=\"has-text-align-center\" data-align=\"center\">RAM or object storage<\/td><\/tr><tr><td><strong>Max practical scale<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Billions of vectors<\/td><td class=\"has-text-align-center\" data-align=\"center\">100-200M vectors (RAM-limited)<\/td><td class=\"has-text-align-center\" data-align=\"center\">Hundreds of millions<\/td><\/tr><tr><td><strong>Query latency<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Low (5-15 ms typical)<\/td><td class=\"has-text-align-center\" data-align=\"center\">Very low (1-5 ms)<\/td><td class=\"has-text-align-center\" data-align=\"center\">Low-medium<\/td><\/tr><tr><td><strong>Memory footprint<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Low (PQ-compressed in RAM)<\/td><td class=\"has-text-align-center\" data-align=\"center\">High (full vectors in DRAM)<\/td><td class=\"has-text-align-center\" data-align=\"center\">Medium<\/td><\/tr><tr><td><strong>Real-time updates<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Via FreshDiskANN<\/td><td class=\"has-text-align-center\" data-align=\"center\">Supported natively<\/td><td class=\"has-text-align-center\" data-align=\"center\">Expensive (rebuild)<\/td><\/tr><tr><td><strong>Filtered search<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Via Filtered-DiskANN<\/td><td class=\"has-text-align-center\" data-align=\"center\">Via post-filtering<\/td><td class=\"has-text-align-center\" data-align=\"center\">Strong (IVF variants)<\/td><\/tr><tr><td><strong>Best for<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">Billion-scale RAG, agents, recommendations on cost-effective hardware<\/td><td class=\"has-text-align-center\" data-align=\"center\">Smaller datasets where ultra-low latency is critical and RAM is available<\/td><td class=\"has-text-align-center\" data-align=\"center\">Filtered search with high filter ratios (&gt;85%)<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>If your dataset fits in RAM (under ~100M vectors), HNSW typically delivers the lowest latency. Once you&#8217;re indexing hundreds of millions of vectors, or when RAM cost is a constraint, DiskANN is the more practical choice. For workloads where 85%+ of the dataset is filtered out before search, IVF variants can outperform both.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Performance and benchmarks<\/h2>\n\n\n\n<p><strong>Original NeurIPS 2019 results (Microsoft Research):<\/strong> On the SIFT-1B dataset (1 billion 128-dimensional vectors), DiskANN achieved 5,000 QPS at 95%+ recall@1 with sub-5 ms average latency on a single machine with 64GB RAM and an NVMe SSD. That&#8217;s 5-10x more vectors per machine than DRAM-based solutions at equivalent performance.<\/p>\n\n\n\n<p><strong>Couchbase Hyperscale Vector Index benchmark (October 2025):<\/strong> Couchbase&#8217;s Vamana\/DiskANN-based Hyperscale Vector Index, introduced in Couchbase 8.0, was independently benchmarked using VectorDBBench against a 1 billion-vector dataset:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>700+ QPS<\/strong> with sub-second latency at 93% recall<\/li>\n\n\n\n<li><strong>350x faster<\/strong> than MongoDB Atlas, which returned 2 QPS with over 40 seconds of average latency at 89% recall under identical conditions<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">DiskANN use cases<\/h2>\n\n\n\n<p><strong>RAG:<\/strong> Enterprise RAG systems that span billions of document chunks need high-recall retrieval without RAM-heavy infrastructure. DiskANN is well-suited to workloads where prompt content is unpredictable and broad semantic coverage is essential.<\/p>\n\n\n\n<p><strong>AI agents and contextual memory:<\/strong> Agentic systems accumulate interaction history, preferences, and task context as vectors over time. DiskANN lets agents search an unbounded and growing memory corpus without RAM becoming a bottleneck.<\/p>\n\n\n\n<p><strong>Semantic search and recommendations:<\/strong> E-commerce, media, and enterprise search platforms operating over hundreds of millions to billions of items benefit from DiskANN&#8217;s throughput and recall accuracy, especially when combined with metadata prefiltering via Filtered-DiskANN.<\/p>\n\n\n\n<p><strong>Privacy-first and on-premises AI:<\/strong> When data cannot leave a controlled environment, DiskANN&#8217;s ability to run on local SSD hardware makes<a href=\"https:\/\/www.couchbase.com\/blog\/pt\/enhancing-genai-for-privacy-and-performance\/\"> privacy-preserving GenAI applications<\/a> more practical than approaches requiring cloud-hosted, RAM-intensive clusters.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DiskANN in databases and platforms<\/h2>\n\n\n\n<p><strong>Couchbase Hyperscale Vector Index:<\/strong> Couchbase 8.0 introduced the Hyperscale Vector Index (HVI), a hybrid Vamana + IVF implementation available in<a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/capella\/\"> Couchbase Capella<\/a> and self-managed deployments. It operates across partitioned disks for distributed processing and is specifically designed for RAG workloads that require broad semantic coverage.<\/p>\n\n\n\n<p><strong>Microsoft\u2019s Azure Cosmos DB and Azure Database for PostgreSQL:<\/strong> Azure Cosmos DB uses DiskANN to power vector search in its NoSQL API. Azure Database for PostgreSQL offers it as a Vamana-based alternative to pgvector&#8217;s HNSW and IVFFlat.<\/p>\n\n\n\n<p><strong>Milvus and Zilliz:<\/strong> Milvus is an open-source vector database that supports DiskANN as an on-disk index type (DISKANN) for billion-scale collections with the Vamana graph on disk and PQ-compressed vectors in RAM. Zilliz Cloud is the fully managed enterprise version of Milvus.<\/p>\n\n\n\n<p><strong>pgvectorscale (TimescaleDB):<\/strong> pgvectorscale is a PostgreSQL extension developed by Timescale that implements StreamingDiskANN. It\u2019s optimized for continuously updated time-series and streaming datasets.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">How to tune DiskANN<\/h2>\n\n\n\n<p>DiskANN&#8217;s key parameters govern the trade-off between recall latency and throughput:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>MaxDegree:<\/strong> Maximum out-edges per graph node. Higher values improve recall but increase index size and SSD reads. Default: 56.<\/li>\n\n\n\n<li><strong>SearchListSize:<\/strong> Candidate list size during search. Increase to 150-200 for high-recall workloads (RAG, agent memory); keep at 100 for maximum throughput. Default: 100.<\/li>\n\n\n\n<li><strong>PQCodeBudgetGBRatio:<\/strong> Fraction of dataset size to cache as PQ-compressed vectors in RAM. Increase to 0.2 if RAM headroom allows and latency is critical. Default: 0.125.<\/li>\n\n\n\n<li><strong>BeamWidthRatio:<\/strong> Parallel SSD reads per query step. Tune upward (6.0-8.0) to maximize QPS on high-throughput workloads. Default: 4.0.<\/li>\n<\/ul>\n\n\n\n<p>For hardware sizing, budget roughly 750GB-1TB of NVMe SSD for a 1B 128-dimensional float32 dataset (512GB for full-precision vectors + ~224GB for graph edges), and 64-128GB of RAM for the PQ cache. DiskANN is not CPU-bound; 8-16 cores are sufficient for most deployments.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Key takeaways<\/h2>\n\n\n\n<p>DiskANN addresses the fundamental economic problem of in-memory ANN indexing by using inexpensive SSDs rather than expensive RAM. By combining the Vamana graph construction algorithm with product quantization, it achieves recall and latency competitive with in-memory approaches at a fraction of the infrastructure cost.<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>DiskANN is a graph-based ANN algorithm from Microsoft Research (NeurIPS 2019) that is built on the Vamana directed graph construction algorithm.<\/li>\n\n\n\n<li>It stores the full index and full-precision vectors on SSD, caching only PQ-compressed vectors in RAM for fast approximate routing.<\/li>\n\n\n\n<li>It indexes 1B+ vectors on a single machine with 64GB RAM, achieving 95%+ recall@1 with sub-5 ms latency on the SIFT-1B benchmark.<\/li>\n\n\n\n<li>DiskANN indexes 5-10x more vectors per machine than DRAM-only algorithms at equivalent latency, directly reducing infrastructure cost.<\/li>\n\n\n\n<li>It&#8217;s the right choice when datasets exceed 100-200M vectors or when RAM cost is a constraint. HNSW is preferable for smaller latency-critical workloads.<\/li>\n\n\n\n<li>FreshDiskANN extends DiskANN to support real-time inserts, deletes, and updates without full index rebuilds.<\/li>\n\n\n\n<li>Couchbase&#8217;s Hyperscale Vector Index delivers 700+ QPS at 93% recall at billion-vector scale. This is 350x faster than MongoDB Atlas in independent VectorDBBench testing.<\/li>\n<\/ol>\n\n\n\n<p><strong>Related resources<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/couchbase-8-hyperscale-ai\/\">Couchbase 8.0: Unified Data Platform for Hyperscale AI Applications<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/docs.couchbase.com\/cloud\/vector-index\/hyperscale-vector-index.html\">Vector Search Using Hyperscale Vector Indexes<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/ai-services\/\">AI Services in Capella<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/vector-search\/\">Vector Search Database<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/rag-app-vector-ios\/\">How I Built a Plant RAG Application With Couchbase Vector Search on iOS<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/use-cases\/artificial-intelligence\/\">Artificial Intelligence (AI) Use Cases<\/a><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">FAQs<\/h2>\n\n\n\n<p><strong>What is the Vamana algorithm, and how does it differ from HNSW? <\/strong>Vamana builds a single-layer directed graph, while HNSW builds a multi-layer hierarchy. HNSW&#8217;s structure requires the entire index to be in RAM for efficient pointer traversal. Vamana&#8217;s single-layer design, with explicit long-range edges added during construction, enables the same fast navigation from disk without the RAM dependency.<\/p>\n\n\n\n<p><strong>How does product quantization work in DiskANN, and why is it necessary? <\/strong>PQ compresses each vector into a compact code (typically 16-32x smaller) by dividing it into sub-vectors and mapping each to a learned centroid. DiskANN stores these codes in RAM for fast approximate routing, then fetches full-precision vectors from SSD only for final reranking, keeping the in-memory footprint tractable even at billion-vector scale.<\/p>\n\n\n\n<p><strong>What are DiskANN&#8217;s limitations, and when is it not the best choice? <\/strong>DiskANN is less suitable for small datasets (under ~10M vectors), where HNSW offers lower latency with less overhead. It\u2019s also less suitable for workloads with very high filter ratios (85-98%), where IVF variants outperform graph-based indexes. Query latency is also highly sensitive to disk speed, and SATA SSDs will significantly underperform NVMe.<\/p>\n\n\n\n<p><strong>How do I estimate hardware requirements for a DiskANN deployment? <\/strong>For a 1B 128-dimensional float32 dataset, budget roughly 750GB-1TB of NVMe SSD (512GB for vectors plus ~224GB for graph edges) and 64-128GB of RAM for the PQ cache. DiskANN is I\/O-bound rather than CPU-bound, so 8-16 cores are sufficient for most production deployments.<strong>Which databases support DiskANN? <\/strong>DiskANN is available in Couchbase 8.0 (Hyperscale Vector Index, benchmarked at 700+ QPS at billion-vector scale), Azure Cosmos DB, Azure Database for PostgreSQL, Milvus\/Zilliz Cloud, and TimescaleDB&#8217;s pgvectorscale. Microsoft also uses DiskANN in Bing and Microsoft 365, making it the most widely deployed billion-scale vector search algorithm in enterprise infrastructure today.<\/p>","protected":false},"excerpt":{"rendered":"<p>Retrieval-augmented generation (RAG), semantic search, and AI agents all depend on one thing: the ability to quickly find the most relevant vectors in a large dataset. As embedding datasets grow from millions to billions of records, purely in-memory vector indexes become financially unsustainable. DiskANN solves this problem by storing vector indexes on SSD rather than [&hellip;]<\/p>\n","protected":false},"author":81637,"featured_media":5463,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[715],"tags":[],"ppma_author":[1022],"class_list":["post-5462","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-vector-search"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.6 (Yoast SEO v27.6) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>What Is DiskANN? Billion-Scale Vector Search Explained - The Couchbase Blog<\/title>\n<meta name=\"description\" content=\"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/pt\/diskann\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What Is DiskANN? Billion-Scale Vector Search Explained\" \/>\n<meta property=\"og:description\" content=\"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/pt\/diskann\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-08T17:52:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-06-08T17:52:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png\" \/>\n\t<meta property=\"og:image:width\" content=\"2400\" \/>\n\t<meta property=\"og:image:height\" content=\"1256\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Hannah Laurel\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Hannah Laurel\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/\"},\"author\":{\"name\":\"Hannah Laurel\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/d70b9304da33992d8663bf2933fa52cb\"},\"headline\":\"What Is DiskANN? Billion-Scale Vector Search Explained\",\"datePublished\":\"2026-06-08T17:52:06+00:00\",\"dateModified\":\"2026-06-08T17:52:07+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/\"},\"wordCount\":1748,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png\",\"articleSection\":[\"Vector Search\"],\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/\",\"name\":\"What Is DiskANN? Billion-Scale Vector Search Explained - The Couchbase Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png\",\"datePublished\":\"2026-06-08T17:52:06+00:00\",\"dateModified\":\"2026-06-08T17:52:07+00:00\",\"description\":\"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png\",\"width\":2400,\"height\":1256},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/diskann\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What Is DiskANN? Billion-Scale Vector Search Explained\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/logo.svg\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/5\\\/2026\\\/06\\\/logo.svg\",\"width\":\"1024\",\"height\":\"1024\",\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/d70b9304da33992d8663bf2933fa52cb\",\"name\":\"Hannah Laurel\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g83799598d1fc957e38a4e9f3226e010d\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g\",\"caption\":\"Hannah Laurel\"},\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/pt\\\/author\\\/hannah-laurel\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"What Is DiskANN? Billion-Scale Vector Search Explained - The Couchbase Blog","description":"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/pt\/diskann\/","og_locale":"pt_BR","og_type":"article","og_title":"What Is DiskANN? Billion-Scale Vector Search Explained","og_description":"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.","og_url":"https:\/\/www.couchbase.com\/blog\/pt\/diskann\/","og_site_name":"The Couchbase Blog","article_published_time":"2026-06-08T17:52:06+00:00","article_modified_time":"2026-06-08T17:52:07+00:00","og_image":[{"width":2400,"height":1256,"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png","type":"image\/png"}],"author":"Hannah Laurel","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Hannah Laurel","Est. reading time":"8 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/"},"author":{"name":"Hannah Laurel","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/d70b9304da33992d8663bf2933fa52cb"},"headline":"What Is DiskANN? Billion-Scale Vector Search Explained","datePublished":"2026-06-08T17:52:06+00:00","dateModified":"2026-06-08T17:52:07+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/"},"wordCount":1748,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png","articleSection":["Vector Search"],"inLanguage":"pt-BR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/diskann\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/diskann\/","url":"https:\/\/www.couchbase.com\/blog\/diskann\/","name":"What Is DiskANN? Billion-Scale Vector Search Explained - The Couchbase Blog","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png","datePublished":"2026-06-08T17:52:06+00:00","dateModified":"2026-06-08T17:52:07+00:00","description":"DiskANN is a graph-based vector search algorithm that indexes billions of vectors on SSD with high recall and millisecond latency. Learn how it works, how it compares to HNSW, and which databases use it.","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/diskann\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/What-Is-DiskANN_-Billion-Scale-Vector-Search-Explained.png","width":2400,"height":1256},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/diskann\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"What Is DiskANN? Billion-Scale Vector Search Explained"}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"The Couchbase Blog","description":"Couchbase, the NoSQL Database","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"The Couchbase Blog","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/logo.svg","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/5\/2026\/06\/logo.svg","width":"1024","height":"1024","caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/d70b9304da33992d8663bf2933fa52cb","name":"Hannah Laurel","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/secure.gravatar.com\/avatar\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g83799598d1fc957e38a4e9f3226e010d","url":"https:\/\/secure.gravatar.com\/avatar\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1dd35f9b7985360f147d42a040c78c7960583704fa9a68a2bfef9c4de16e2cbd?s=96&d=mm&r=g","caption":"Hannah Laurel"},"url":"https:\/\/www.couchbase.com\/blog\/pt\/author\/hannah-laurel\/"}]}},"acf":[],"authors":[{"term_id":1022,"user_id":81637,"is_guest":0,"slug":"hannah-laurel","display_name":"Hannah Laurel","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/5462","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/users\/81637"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/comments?post=5462"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/5462\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media\/5463"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media?parent=5462"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/categories?post=5462"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/tags?post=5462"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/ppma_author?post=5462"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}