Vector Search – NVIDIA Technical Blog

Vector Search – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-11T15:00:00Z http://www.open-lab.net/blog/feed/ Nicolas Dupont <![CDATA[Bringing Confidentiality to Vector Search with Cyborg and NVIDIA cuVS]]> http://www.open-lab.net/blog/?p=87131 2024-10-03T21:17:06Z 2024-08-15T16:00:00Z

In the era of generative AI, vector databases have become indispensable for storing and querying high-dimensional data efficiently. However, like all databases,...]]>

In the era of generative AI, vector databases have become indispensable for storing and querying high-dimensional data efficiently. However, like all databases,...

Bringing Confidentiality to Vector Search with Cyborg and RAPIDS cuVS

In the era of generative AI, vector databases have become indispensable for storing and querying high-dimensional data efficiently. However, like all databases, vector databases are vulnerable to a range of attacks, including cyber threats, phishing attempts, and unauthorized access. This vulnerability is particularly concerning considering that these databases often contain sensitive and��

]]> 0 Artem Chirkin <![CDATA[Accelerating Vector Search: NVIDIA cuVS IVF-PQ Part 2, Performance Tuning]]> http://www.open-lab.net/blog/?p=81681 2024-10-03T21:18:45Z 2024-07-18T17:10:03Z

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...]]>

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the...

cuvs ivf-pq part 2

In the first part of the series, we presented an overview of the IVF-PQ algorithm and explained how it builds on top of the IVF-Flat algorithm, using the Product Quantization (PQ) technique to compress the index and support larger datasets. In this part two of the IVF-PQ post, we cover the practical aspects of tuning IVF-PQ performance. It��s worth noting again that IVF-PQ uses a lossy��

]]> 0 Artem Chirkin <![CDATA[Accelerating Vector Search: NVIDIA cuVS IVF-PQ Part 1, Deep Dive]]> http://www.open-lab.net/blog/?p=81652 2024-10-03T21:19:09Z 2024-07-18T17:09:45Z

In this post, we continue the series on accelerating vector search using NVIDIA cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...]]>

In this post, we continue the series on accelerating vector search using NVIDIA cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for...

cuvs ivf-pq part 1

In this post, we continue the series on accelerating vector search using NVIDIA cuVS. Our previous post in the series introduced IVF-Flat, a fast algorithm for accelerating approximate nearest neighbors (ANN) search on GPUs. We discussed how using an inverted file index (IVF) provides an intuitive way to reduce the complexity of the nearest neighbor search by limiting it to only a small subset of��

]]> 0 Tam��s Feh��r <![CDATA[Accelerated Vector Search: Approximating with NVIDIA cuVS Inverted Index]]> http://www.open-lab.net/blog/?p=70772 2024-11-07T05:00:52Z 2023-10-02T18:16:58Z

Performing an exhaustive exact k-nearest neighbor (kNN) search, also known as brute-force search, is expensive, and it doesn��t scale particularly well to...]]>

Performing an exhaustive exact k-nearest neighbor (kNN) search, also known as brute-force search, is expensive, and it doesn��t scale particularly well to...

ivf_blog_search2

Performing an exhaustive exact k-nearest neighbor (kNN) search, also known as brute-force search, is expensive, and it doesn��t scale particularly well to larger datasets. During vector search, brute-force search requires the distance to be calculated between every query vector and database vector. For the frequently used Euclidean and cosine distances, the computation task becomes equivalent to a��

]]> 0 Mickael Ide <![CDATA[Accelerating Vector Search: Fine-Tuning GPU Index Algorithms]]> http://www.open-lab.net/blog/?p=69885 2024-11-18T21:16:14Z 2023-09-11T16:00:00Z

In this post, we dive deeper into each of the GPU-accelerated indexes mentioned in part 1 and give a brief explanation of how the algorithms work, along with a...]]>

In this post, we dive deeper into each of the GPU-accelerated indexes mentioned in part 1 and give a brief explanation of how the algorithms work, along with a...

vector-search-part-2-featured

In this post, we dive deeper into each of the GPU-accelerated indexes mentioned in part 1 and give a brief explanation of how the algorithms work, along with a summary of important parameters to fine-tune their behavior. We then go through a simple end-to-end example to demonstrate cuVS�� Python APIs on a question-and-answer problem with a pretrained large language model and provide a��

]]> 0 Mickael Ide <![CDATA[Accelerating Vector Search: Using GPU-Powered Indexes with NVIDIA cuVS]]> http://www.open-lab.net/blog/?p=69884 2024-11-07T05:04:43Z 2023-09-11T15:59:00Z

In the current AI landscape, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic...]]>

In the current AI landscape, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic...

vector-search-part-1-featured

In the current AI landscape, vector search is one of the hottest topics due to its applications in large language models (LLM) and generative AI. Semantic vector search enables a broad range of important tasks like detecting fraudulent transactions, recommending products to users, using contextual information to augment full-text searches, and finding actors that pose potential security risks.

]]> 0 Corey Nolet <![CDATA[Reusable Computational Patterns for Machine Learning and Information Retrieval with RAPIDS RAFT]]> http://www.open-lab.net/blog/?p=62315 2023-10-13T05:52:25Z 2023-03-22T15:00:00Z

RAPIDS is a suite of accelerated libraries for data science and machine learning on GPUs: cuDF for pandas-like data structures cuGraph for graph data cuML for...]]>

RAPIDS is a suite of accelerated libraries for data science and machine learning on GPUs: cuDF for pandas-like data structures cuGraph for graph data cuML for... Computer art of squares on a dark background, with light coming from one side.

Computer art of squares on a dark background, with light coming from one side.

RAPIDS is a suite of accelerated libraries for data science and machine learning on GPUs: In many data analytics and machine learning algorithms, computational bottlenecks tend to come from a small subset of steps that dominate the end-to-end performance. Reusable solutions for these steps often require low-level primitives that are non-trivial and time-consuming to write well.

]]> 0 Michelle Horton <![CDATA[Accelerating Billion Vector Similarity Searches with GPUs]]> http://www.open-lab.net/blog/?p=35502 2023-10-06T08:15:47Z 2021-07-29T15:47:48Z

Relying on the capabilities of GPUs, a team from Facebook AI Research has developed a faster, more efficient way for AI to run similarity searches. The study,...]]>

Relying on the capabilities of GPUs, a team from Facebook AI Research has developed a faster, more efficient way for AI to run similarity searches. The study,... A collection of images.

A collection of images.

Relying on the capabilities of GPUs, a team from Facebook AI Research has developed a faster, more efficient way for AI to run similarity searches. The study, published in IEEE Transactions on Big Data, creates a deep learning algorithm capable of handling and comparing high-dimensional data from media that is notably faster, while just as accurate as previous techniques. In a world with an��

]]> 0 ��˳��97caoporen��