Benedikt Schifferer – NVIDIA Technical Blog

Benedikt Schifferer – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-06-30T16:56:42Z http://www.open-lab.net/blog/feed/ Benedikt Schifferer <![CDATA[Best-in-Class Multimodal RAG: How the Llama 3.2 NeMo Retriever Embedding Model Boosts Pipeline Accuracy]]> http://www.open-lab.net/blog/?p=102704 2025-06-30T16:56:42Z 2025-06-30T16:56:34Z

Data goes far beyond text��it is inherently multimodal, encompassing images, video, audio, and more, often in complex and unstructured formats. While the...]]>

Data goes far beyond text—it is inherently multimodal, encompassing images, video, audio, and more, often in complex and unstructured formats. While the common method is to convert PDFs, scanned images, slides, and other documents into text, it is challenging to capture all information in text format, as shown in Figure 1. The loss of visual information in text motivated the development of…

]]> Benedikt Schifferer <![CDATA[Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage]]> http://www.open-lab.net/blog/?p=93638 2024-12-17T20:42:28Z 2024-12-17T16:00:00Z

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...]]>

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity, summarization, and item recommendation. It also plays a pivotal role in retrieval-augmented generation (RAG), a technique that enables large language models (LLMs) to access external context without modifying underlying parameters.

]]> Benedikt Schifferer <![CDATA[Evaluating Retriever for Enterprise-Grade RAG]]> http://www.open-lab.net/blog/?p=78222 2024-10-28T21:59:05Z 2024-02-23T19:02:26Z

The conversation about designing and evaluating Retrieval-Augmented Generation (RAG) systems is a long, multi-faceted discussion. Even when we look at retrieval...]]>

The conversation about designing and evaluating Retrieval-Augmented Generation (RAG) systems is a long, multi-faceted discussion. Even when we look at retrieval on its own, developers selectively employ many techniques, such as query decomposition, re-writing, building soft filters, and more, to increase the accuracy of their RAG pipelines. While the techniques vary from system to system…

]]> 0 Benedikt Schifferer <![CDATA[Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model]]> http://www.open-lab.net/blog/?p=74346 2024-10-28T22:00:06Z 2023-11-28T18:10:50Z

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation...]]>

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation enterprise productivity applications, they enhance user efficiency across tasks like programming, copy editing, brainstorming, and answering questions on a wide range of topics. However, these models often struggle with real-time events and…

]]> 0 Benedikt Schifferer <![CDATA[Using Neural Networks for Your Recommender System]]> http://www.open-lab.net/blog/?p=34713 2024-10-28T19:23:25Z 2021-07-20T13:00:00Z

Deep learning (DL) is the state-of-the-art solution for many machine learning problems, such as computer vision or natural language problems and it outperforms...]]>

Deep learning (DL) is the state-of-the-art solution for many machine learning problems, such as computer vision or natural language problems and it outperforms alternative methods. Recent trends include applying DL techniques to recommendation engines. Many large companies—such as AirBnB, Facebook, Google, Home Depot, LinkedIn, and Pinterest—share their experience in using DL for recommender…

]]> 2 Benedikt Schifferer <![CDATA[How to Build a Winning Deep Learning Powered Recommender System-Part 3]]> http://www.open-lab.net/blog/?p=31268 2024-10-28T19:15:46Z 2021-05-06T18:00:00Z

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming....]]>

Recommender systems (RecSys) have become a key component in many online services, such as e-commerce, social media, news service, or online video streaming. However with the growth in importance, the growth in scale of industry datasets, and more sophisticated models, the bar has been raised for computational resources required for recommendation systems. To meet the computational demands…

]]> 0 Benedikt Schifferer <![CDATA[Announcing the NVIDIA NVTabular Open Beta with Multi-GPU Support and New Data Loaders]]> http://www.open-lab.net/blog/?p=21200 2024-10-28T18:24:20Z 2020-10-05T13:00:00Z

Recently, NVIDIA CEO Jensen Huang announced updates to the open beta of NVIDIA Merlin, an end-to-end framework that democratizes the development of large-scale...]]>

Recently, NVIDIA CEO Jensen Huang announced updates to the open beta of NVIDIA Merlin, an end-to-end framework that democratizes the development of large-scale deep learning recommenders. With NVIDIA Merlin, data scientists, machine learning engineers, and researchers can accelerate their entire workflow pipeline from ingesting and training to deploying GPU-accelerated recommenders (Figure 1).

]]> 0 ��˳��97caoporen��