Rohil Bhargava – NVIDIA Technical Blog

Rohil Bhargava – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-20T22:34:30Z http://www.open-lab.net/blog/feed/ Rohil Bhargava <![CDATA[NVIDIA Blackwell Ultra for the Era of AI Reasoning]]> http://www.open-lab.net/blog/?p=96761 2025-03-20T22:34:30Z 2025-03-19T18:00:15Z

For years, advancements in AI have followed a clear trajectory through pretraining scaling: larger models, more data, and greater computational resources lead...]]>

For years, advancements in AI have followed a clear trajectory through pretraining scaling: larger models, more data, and greater computational resources lead to breakthrough capabilities. In the last 5 years, pretraining scaling has increased compute requirements at an incredible rate of 50M times. However, building more intelligent systems is no longer just about pretraining bigger models.

]]> Rohil Bhargava <![CDATA[Accelerating Oracle Database Generative AI Workloads with NVIDIA NIM and NVIDIA cuVS]]> http://www.open-lab.net/blog/?p=88963 2024-10-28T21:54:43Z 2024-09-17T19:04:16Z

The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...]]>

The vast majority of the world’s data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI applications that will make a transformative business impact. Retrieval-augmented generation (RAG) pipelines are a key part of this, enabling users to have conversations with large corpuses of data and turning manuals, policy documents…

]]> Rohil Bhargava <![CDATA[Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform]]> http://www.open-lab.net/blog/?p=78388 2025-03-18T18:31:44Z 2024-03-07T19:05:46Z

Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...]]>

As of 3/18/25, NVIDIA Triton Inference Server is now NVIDIA Dynamo. Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by iteratively shaping random noise into AI-generated art through denoising diffusion techniques. This can be applied to many enterprise use cases such as creating personalized…

]]> 1 Rohil Bhargava <![CDATA[Deploying Retrieval-Augmented Generation Applications on NVIDIA GH200 Delivers Accelerated Performance]]> http://www.open-lab.net/blog/?p=74632 2024-09-22T15:11:34Z 2023-12-18T17:00:00Z

Large language model (LLM) applications are essential in enhancing productivity across industries through natural language. However, their effectiveness is...]]>

Large language model (LLM) applications are essential in enhancing productivity across industries through natural language. However, their effectiveness is often limited by the extent of their training data, resulting in poor performance when dealing with real-time events and new knowledge the LLM isn’t trained on. Retrieval-augmented generation (RAG) solves these problems.

]]> 3 Rohil Bhargava <![CDATA[Boosting AI Model Inference Performance on Azure Machine Learning]]> http://www.open-lab.net/blog/?p=54061 2022-11-14T21:35:42Z 2022-08-29T17:00:00Z

Every AI application needs a strong inference engine. Whether you��re deploying an image recognition service, intelligent virtual assistant, or a fraud...]]>

Join the NVIDIA Triton and NVIDIA TensorRT community to stay current on the latest product updates, bug fixes, content, best practices, and more. Every AI application needs a strong inference engine. Whether you’re deploying an image recognition service, intelligent virtual assistant, or a fraud detection application, a reliable inference server delivers fast, accurate…

]]> 0 Rohil Bhargava <![CDATA[Building a Speech-Enabled AI Virtual Assistant with NVIDIA Riva on Amazon EC2]]> http://www.open-lab.net/blog/?p=50606 2023-03-14T18:54:05Z 2022-07-28T15:30:00Z

Speech AI can assist human agents in contact centers, power virtual assistants and digital avatars, generate live captioning in video conferencing, and much...]]>

Speech AI can assist human agents in contact centers, power virtual assistants and digital avatars, generate live captioning in video conferencing, and much more. Under the hood, these voice-based technologies orchestrate a network of automatic speech recognition (ASR) and text-to-speech (TTS) pipelines to deliver intelligent, real-time responses. Sign up for the latest Data Science news.

]]> 3 ��˳��97caoporen��