Vishal Mehta – NVIDIA Technical Blog

Vishal Mehta – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-18T18:27:34Z http://www.open-lab.net/blog/feed/ Vishal Mehta <![CDATA[Demystifying AI Inference Deployments for Trillion Parameter Large Language Models]]> http://www.open-lab.net/blog/?p=83013 2025-03-18T18:27:34Z 2024-06-12T16:00:00Z

AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as...]]>

As of March 18, 2025, NVIDIA Triton Inference Server is now part of the NVIDIA Dynamo Platform and has been renamed to NVIDIA Dynamo Triton, accordingly. AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as well as solving commercial problems such as automating the creation of e-commerce…

]]> 2 Vishal Mehta <![CDATA[NVIDIA Grace CPU Superchip Architecture In Depth]]> http://www.open-lab.net/blog/?p=59829 2023-12-06T23:46:07Z 2023-01-20T19:30:00Z

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC)...]]>

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC) design, and resilient high-bandwidth low-power memory technologies, the Grace CPU was built from the ground up to create the world’s first superchip for computing. At the heart of the superchip, lies the NVLink Chip-2-Chip (C2C).

]]> 2 Vishal Mehta <![CDATA[NVIDIA Grace Hopper Superchip Architecture In-Depth]]> http://www.open-lab.net/blog/?p=57192 2022-11-18T11:48:05Z 2022-11-10T19:00:00Z

The NVIDIA Grace Hopper Superchip Architecture is the first true heterogeneous accelerated platform for high-performance computing (HPC) and AI workloads. It...]]>

The NVIDIA Grace Hopper Superchip Architecture is the first true heterogeneous accelerated platform for high-performance computing (HPC) and AI workloads. It accelerates applications with the strengths of both GPUs and CPUs while providing the simplest and most productive distributed heterogeneous programming model to date. Scientists and engineers can focus on solving the world’s most important…

]]> 11 Vishal Mehta <![CDATA[NVIDIA Hopper Architecture In-Depth]]> http://www.open-lab.net/blog/?p=45555 2023-10-25T23:51:26Z 2022-03-22T18:00:00Z

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU...]]>

Today during the 2022 NVIDIA GTC Keynote address, NVIDIA CEO Jensen Huang introduced the new NVIDIA H100 Tensor Core GPU based on the new NVIDIA Hopper GPU architecture. This post gives you a look inside the new H100 GPU and describes important new features of NVIDIA Hopper architecture GPUs. The NVIDIA H100 Tensor Core GPU is our ninth-generation data center GPU designed to deliver an…

]]> 2 Vishal Mehta <![CDATA[Accelerating Random Forests Up to 45x Using cuML]]> http://www.open-lab.net/blog/?p=23368 2022-08-21T23:40:56Z 2021-02-25T18:27:11Z

Random forests are a popular machine learning technique for classification and regression problems. By building multiple independent decision trees, they reduce...]]>

This post was originally published on the RAPIDS AI blog. Random forests are a popular machine learning technique for classification and regression problems. By building multiple independent decision trees, they reduce the problems of overfitting seen with individual trees. In this post, I review the basic random forest algorithms, show how their training can be parallelized on NVIDIA…

]]> 1 ��˳��97caoporen��