Models / Libraries / Frameworks

May 19, 2025

Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine

In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...

9 MIN READ

May 19, 2025

NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11

AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...

9 MIN READ

May 18, 2025

Spotlight: Perfect Corp. Delivers Personalized Digital Beauty Experiences Using NVIDIA TensorRT and NVENC

Augmented reality (AR) and AI are revolutionizing the beauty and fashion industry by offering hyperpersonalized experiences, from virtual try-ons to AI-driven...

4 MIN READ

May 16, 2025

Build Agents and Understand Long Docs with Mistral Medium 3 and NVIDIA NIM

Developers building powerful multimodal applications now have a new state-of-the-art model designed for enterprise-scale performance with Mistral Medium 3....

2 MIN READ

May 15, 2025

Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled

Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...

10 MIN READ

A drawing of a person holding a phone, with a callout of the phone screen and chat bubbles.

May 15, 2025

Accelerating Embedding Lookups with cuEmbed

NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...

8 MIN READ

May 12, 2025

Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0

NVIDIA Warp, a simulation computing framework, is now accessible to all developers.

1 MIN READ

May 12, 2025

Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...

8 MIN READ

An illustration showing molecules and a brain.

May 09, 2025

Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...

11 MIN READ

May 08, 2025

Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework

Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...

6 MIN READ

May 06, 2025

New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it’s streamlining...

5 MIN READ

May 05, 2025

Just Released: CUDA 12.9

New features include enhancements to confidential computing and family-specific features and targets supported by NVCC.

1 MIN READ

May 02, 2025

Integrate and Deploy Tongyi Qwen3 Models into Production Applications with NVIDIA

Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models,...

7 MIN READ

May 02, 2025

An Even Easier Introduction to CUDA (Updated)

Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...

16 MIN READ

An image representing matrix multiplication.

May 01, 2025

Boosting Matrix Multiplication Speed and Flexibility with NVIDIA cuBLAS 12.9

The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more. Two...

8 MIN READ

May 01, 2025

Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML

Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...

7 MIN READ