featured

Jul 23, 2025

Approaches to PDF Data Extraction for Information Retrieval

The PDF is among the most common file formats for sharing information such as financial reports, research papers, technical documents, and marketing materials....

11 MIN READ

Jul 23, 2025

Serverless Distributed Data Processing with Apache Spark and NVIDIA AI on Azure

The process of converting vast libraries of text into numerical representations known as embeddings is essential for generative AI. Various technologies—from...

9 MIN READ

Jul 22, 2025

Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo

Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA’s powerful...

16 MIN READ

Jul 22, 2025

Understanding NCCL Tuning to Accelerate GPU-to-GPU Communication

The NVIDIA Collective Communications Library (NCCL) is essential for fast GPU-to-GPU communication in AI workloads, using various optimizations and tuning to...

14 MIN READ

Jul 22, 2025

Kimi-K2-Instruct Now Available as NVIDIA NIM

Try the new 1T-parameter open source MoE LLM today.

1 MIN READ

Jul 22, 2025

Building Robotic Mental Models with NVIDIA Warp and Gaussian Splatting

This post explores a promising direction for building dynamic digital representations of the physical world, a topic gaining increasing attention in recent...

4 MIN READ

Jul 21, 2025

Traditional RAG vs. Agentic RAG—Why AI Agents Need Dynamic Knowledge to Get Smarter

Ever relied on an old GPS that didn’t know about the new highway bypass, or a sudden road closure? It might get you to your destination, but not in the most...

8 MIN READ

Black and white topology of connected nodes in NVIDIA Air.

Jul 18, 2025

Automating Network Design in NVIDIA Air with Ansible and Git

At its core, NVIDIA Air is built for automation. Every part of your network can be coded, versioned, and set to trigger automatically. This includes creating...

6 MIN READ

Jul 18, 2025

Optimizing for Low-Latency Communication in Inference Workloads with JAX and XLA

Running inference with large language models (LLMs) in production requires meeting stringent latency constraints. A critical stage in the process is LLM decode,...

6 MIN READ

Jul 18, 2025

3 pandas Workflows That Slowed to a Crawl on Large Datasets—Until We Turned on GPUs

If you work with pandas, you’ve probably hit the wall. It’s that moment when your trusty workflow, so elegant on smaller datasets, grinds to a halt on a...

4 MIN READ

Jul 17, 2025

Hackathon Winners Bring Agentic AI to Life with the NVIDIA NeMo Agent Toolkit

The best way to learn a new toolkit is to build something real, and that’s exactly what developers did at the recent NVIDIA NeMo Agent Toolkit Hackathon. Over...

6 MIN READ

Jul 17, 2025

NVIDIA?Canary?Qwen?2.5B: Open?Source ASR/LLM for Superior Transcription and Summarization

Top?ranked on the HuggingFace Open?ASR leaderboard, the model is production?ready.

1 MIN READ

Jul 17, 2025

Feature Engineering at Scale: Optimizing?ML Models in Semiconductor Manufacturing with NVIDIA?CUDA?X?Data Science

In our previous post, we introduced the setup of predictive modeling in chip manufacturing and operations, highlighting common challenges such as imbalanced...

6 MIN READ

Jul 17, 2025

New Learning Pathway: Deploy AI Models with NVIDIA NIM on GKE

Get hands-on with Google Kubernetes Engine (GKE) and NVIDIA NIM when you join the new Google Cloud and NVIDIA community.

1 MIN READ

Jul 17, 2025

Safeguard Agentic AI Systems with the NVIDIA Safety Recipe

As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their...

7 MIN READ

Jul 16, 2025

Driving AI-Powered Robotics Development with NVIDIA Isaac for Healthcare

By 2030, the World Health Organization projects a global shortage of over 15 million healthcare workers, including surgeons, radiologists, and nurses. In the...

6 MIN READ