Models / Libraries / Frameworks

Jun 04, 2025

NVIDIA Speech AI Models Deliver Industry-Leading Accuracy and Performance

NVIDIA is driving state-of-the-art performance, efficiency, and accessibility in both speech AI and language models, setting the stage for innovations that are...

5 MIN READ

Jun 02, 2025

Scaling to Millions of Tokens with Efficient Long-Context LLM Training

The evolution of large language models (LLMs) has been marked by significant advancements in their ability to process and generate text. Among these...

7 MIN READ

May 27, 2025

Advanced Optimization Strategies for LLM Training on NVIDIA Grace Hopper

In the previous post, Profiling LLM Training Workflows on NVIDIA Grace Hopper, we explored the importance of profiling large language model (LLM) training...

10 MIN READ

May 27, 2025

Profiling LLM Training Workflows on NVIDIA Grace Hopper

The rapid advancements in AI have resulted in an era of exponential growth in model sizes, particularly in the domain of large language models (LLMs). These...

12 MIN READ

May 23, 2025

Unlock Efficient Data Processing with the Latest from NVIDIA DALI

NVIDIA DALI, a portable, open source software library for decoding and augmenting images, videos, and speech, recently introduced several features that improve...

8 MIN READ

May 22, 2025

Spotlight: Infleqtion Optimizes Portfolios Using Q-CHOP and NVIDIA CUDA-Q Dynamics

Computing is an essential tool for the modern financial services industry. Profits are won and lost based on the speed and accuracy of algorithms guiding...

9 MIN READ

May 22, 2025

Grandmaster Pro Tip: Winning First Place in a Kaggle Competition with Stacking Using cuML

What does it take to win a Kaggle competition in 2025? In the April Playground challenge, the goal was to predict how long users would listen to a podcast—and...

7 MIN READ

May 21, 2025

Just Released: NVIDIA HPC SDK v25.5

The new release includes support for CUDA 12.9, updated library components, and performance improvements.

1 MIN READ

May 19, 2025

Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine

In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...

9 MIN READ

May 19, 2025

NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11

AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...

9 MIN READ

May 18, 2025

Spotlight: Perfect Corp. Delivers Personalized Digital Beauty Experiences Using NVIDIA TensorRT and NVENC

Augmented reality (AR) and AI are revolutionizing the beauty and fashion industry by offering hyperpersonalized experiences, from virtual try-ons to AI-driven...

4 MIN READ

May 16, 2025

Build Agents and Understand Long Docs with Mistral Medium 3 and NVIDIA NIM

Developers building powerful multimodal applications now have a new state-of-the-art model designed for enterprise-scale performance with Mistral Medium 3....

2 MIN READ

May 15, 2025

Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled

Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...

10 MIN READ

A drawing of a person holding a phone, with a callout of the phone screen and chat bubbles.

May 15, 2025

Accelerating Embedding Lookups with cuEmbed

NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...

8 MIN READ

May 12, 2025

Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0

NVIDIA Warp, a simulation computing framework, is now accessible to all developers.

1 MIN READ

May 12, 2025

Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...

8 MIN READ