AI Platforms / Deployment

May 18, 2025

Announcing NVIDIA Exemplar Clouds for Benchmarking AI Cloud Infrastructure

Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it’s nearly...

4 MIN READ

May 18, 2025

Designing AI Factories Using OpenUSD and SimReady Assets

Announced at COMPUTEX 2025, the NVIDIA Omniverse Blueprint for AI factory digital twins has expanded to support OpenUSD schemas. The blueprint features new...

4 MIN READ

May 18, 2025

Integrating Semi-Custom Compute into Rack-Scale Architecture with NVIDIA NVLink Fusion

Data centers are being re-architected for efficient delivery of AI workloads. This is a hugely complicated endeavor, and NVIDIA is now delivering AI factories...

7 MIN READ

May 18, 2025

NVIDIA ConnectX-8 SuperNICs Advance AI Platform Architecture with PCIe Gen6 Connectivity

As AI workloads grow in complexity and scale—from large language models (LLMs) to agentic AI reasoning and physical AI—the demand for faster, more scalable...

5 MIN READ

A drawing of a person holding a phone, with a callout of the phone screen and chat bubbles.

May 15, 2025

Accelerating Embedding Lookups with cuEmbed

NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...

8 MIN READ

An illustration representing generative AI.

Apr 29, 2025

Choosing Your First Local AI Project?

AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced...

7 MIN READ

Apr 24, 2025

Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

7 MIN READ

Apr 22, 2025

NVIDIA GTC Training Labs Now Available On Demand

Missed GTC? This year’s training labs are now available on demand to watch anywhere, anytime.

1 MIN READ

Apr 21, 2025

Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT

State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...

8 MIN READ

Apr 18, 2025

Upcoming Event: NVIDIA Agent Toolkit Hackathon

Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit — contest runs May 12 to May 23.

1 MIN READ

Apr 17, 2025

Neural Rendering in NVIDIA OptiX Using Cooperative Vectors

The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature...

13 MIN READ

Apr 16, 2025

Developing an AI-Powered Tool for Automatic Citation Validation Using NVIDIA NIM

The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can...

9 MIN READ

Apr 15, 2025

NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy

AI is no longer just about generating text or images—it’s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...

8 MIN READ

Apr 09, 2025

Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...

8 MIN READ

Apr 01, 2025

NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration

Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....

10 MIN READ

Mar 31, 2025

Practical Tips for Preventing GPU Fragmentation for Volcano Scheduler

At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...

7 MIN READ