Generative AI

May 22, 2025
Blackwell Breaks the 1,000 TPS/User Barrier With Meta’s Llama 4 Maverick
NVIDIA has achieved a world-record large language model (LLM) inference speed. A single NVIDIA DGX B200 node with eight NVIDIA Blackwell GPUs can achieve over...
9 MIN READ

May 21, 2025
NVIDIA Dynamo Accelerates llm-d Community Initiatives for Advancing Large-Scale Distributed Inference
The introduction of the llm-d community at Red Hat Summit 2025 marks a significant step forward in accelerating generative AI inference innovation for the open...
5 MIN READ

May 20, 2025
Just Announced: Join the Google Cloud & NVIDIA Developer Community
Master AI with Google Cloud & NVIDIA. Access an exclusive community, resources, and rewards.
1 MIN READ

May 20, 2025
NVIDIA Dynamo Adds GPU Autoscaling, Kubernetes Automation, and Networking Optimizations
At NVIDIA GTC 2025, we announced NVIDIA Dynamo, a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning...
7 MIN READ

May 19, 2025
NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11
AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...
9 MIN READ

May 18, 2025
Spotlight: Perfect Corp. Delivers Personalized Digital Beauty Experiences Using NVIDIA TensorRT and NVENC
Augmented reality (AR) and AI are revolutionizing the beauty and fashion industry by offering hyperpersonalized experiences, from virtual try-ons to AI-driven...
4 MIN READ

May 18, 2025
Announcing NVIDIA Exemplar Clouds for Benchmarking AI Cloud Infrastructure
Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it’s nearly...
4 MIN READ

May 16, 2025
Build Agents and Understand Long Docs with Mistral Medium 3 and NVIDIA NIM
Developers building powerful multimodal applications now have a new state-of-the-art model designed for enterprise-scale performance with Mistral Medium 3....
2 MIN READ

May 15, 2025
AI Helps Uncover Potential Alzheimer’s Cause and Treatment
A gene that can be an early indicator for Alzheimer’s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of...
3 MIN READ

May 14, 2025
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ

May 14, 2025
Get Trained and Certified at GTC Paris at VivaTech 2025
Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.
1 MIN READ

May 12, 2025
Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry
The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...
8 MIN READ

May 12, 2025
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ

May 09, 2025
Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research
Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...
11 MIN READ

May 08, 2025
Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks
NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...
12 MIN READ

May 08, 2025
Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT
Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....
5 MIN READ