Data Center / Cloud

May 22, 2025
Blackwell Breaks the 1,000 TPS/User Barrier With Meta’s Llama 4 Maverick
NVIDIA has achieved a world-record large language model (LLM) inference speed. A single NVIDIA DGX B200 node with eight NVIDIA Blackwell GPUs can achieve over...
9 MIN READ

May 21, 2025
NVIDIA Dynamo Accelerates llm-d Community Initiatives for Advancing Large-Scale Distributed Inference
The introduction of the llm-d community at Red Hat Summit 2025 marks a significant step forward in accelerating generative AI inference innovation for the open...
5 MIN READ

May 20, 2025
Just Announced: Join the Google Cloud & NVIDIA Developer Community
Master AI with Google Cloud & NVIDIA. Access an exclusive community, resources, and rewards.
1 MIN READ

May 20, 2025
NVIDIA Dynamo Adds GPU Autoscaling, Kubernetes Automation, and Networking Optimizations
At NVIDIA GTC 2025, we announced NVIDIA Dynamo, a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning...
7 MIN READ

May 20, 2025
NVIDIA 800 V HVDC Architecture Will Power the Next Generation of AI Factories
The exponential growth of AI workloads is increasing data center power demands. Traditional 54 V in-rack power distribution, designed for kilowatt (KW)-scale...
8 MIN READ

May 19, 2025
Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine
In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...
9 MIN READ

May 18, 2025
Announcing NVIDIA Exemplar Clouds for Benchmarking AI Cloud Infrastructure
Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it’s nearly...
4 MIN READ

May 18, 2025
Designing AI Factories Using OpenUSD and SimReady Assets
Announced at COMPUTEX 2025, the NVIDIA Omniverse Blueprint for AI factory digital twins has expanded to support OpenUSD schemas. The blueprint features new...
4 MIN READ

May 18, 2025
Integrating Semi-Custom Compute into Rack-Scale Architecture with NVIDIA NVLink Fusion
Data centers are being re-architected for efficient delivery of AI workloads. This is a hugely complicated endeavor, and NVIDIA is now delivering AI factories...
7 MIN READ

May 18, 2025
NVIDIA ConnectX-8 SuperNICs Advance AI Platform Architecture with PCIe Gen6 Connectivity
As AI workloads grow in complexity and scale—from large language models (LLMs) to agentic AI reasoning and physical AI—the demand for faster, more scalable...
5 MIN READ

May 18, 2025
Deploy AI-RAN at Cell Sites with NVIDIA ARC-Compact?
Wireless networks are the backbone of modern connectivity, serving billions of 5G users through millions of cell sites globally. The opportunities and benefits...
11 MIN READ

May 16, 2025
Building the Modular Foundation for AI Factories with NVIDIA MGX
The exponential growth of generative AI, large language models (LLMs), and high-performance computing has created unprecedented demands on data center...
6 MIN READ

May 15, 2025
Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled
Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...
10 MIN READ

May 14, 2025
Build Custom Reasoning Models with Advanced, Open Post-Training Datasets
Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...
5 MIN READ

May 14, 2025
AI Fabric Resiliency and Why Network Convergence Matters
High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication...
7 MIN READ

May 14, 2025
NVIDIA TensorRT Unlocks FP4 Image Generation ?for NVIDIA Blackwell GeForce RTX 50 Series GPUs
The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX...
11 MIN READ