Generative AI

Jul 23, 2025

Serverless Distributed Data Processing with Apache Spark and NVIDIA AI on Azure

The process of converting vast libraries of text into numerical representations known as embeddings is essential for generative AI. Various technologies—from...

9 MIN READ

Jul 22, 2025

Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo

Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA’s powerful...

16 MIN READ

Jul 22, 2025

Kimi-K2-Instruct Now Available as NVIDIA NIM

Try the new 1T-parameter open source MoE LLM today.

1 MIN READ

Jul 21, 2025

Traditional RAG vs. Agentic RAG—Why AI Agents Need Dynamic Knowledge to Get Smarter

Ever relied on an old GPS that didn’t know about the new highway bypass, or a sudden road closure? It might get you to your destination, but not in the most...

8 MIN READ

Jul 17, 2025

Hackathon Winners Bring Agentic AI to Life with the NVIDIA NeMo Agent Toolkit

The best way to learn a new toolkit is to build something real, and that’s exactly what developers did at the recent NVIDIA NeMo Agent Toolkit Hackathon. Over...

6 MIN READ

Jul 17, 2025

NVIDIA?Canary?Qwen?2.5B: Open?Source ASR/LLM for Superior Transcription and Summarization

Top?ranked on the HuggingFace Open?ASR leaderboard, the model is production?ready.

1 MIN READ

Jul 17, 2025

New Learning Pathway: Deploy AI Models with NVIDIA NIM on GKE

Get hands-on with Google Kubernetes Engine (GKE) and NVIDIA NIM when you join the new Google Cloud and NVIDIA community.

1 MIN READ

Jul 17, 2025

Safeguard Agentic AI Systems with the NVIDIA Safety Recipe

As large language models (LLMs) power more agentic systems capable of performing autonomous actions, tool use, and reasoning, enterprises are drawn to their...

7 MIN READ

Jul 16, 2025

CUTLASS: Principled Abstractions for Handling Multidimensional Data Through Tensors and Spatial Microkernels

In the era of generative AI, utilizing GPUs to their maximum potential is essential to training better models and serving users at scale. Often, these models...

12 MIN READ

Jul 15, 2025

Accelerate AI Model Orchestration with NVIDIA Run:ai on AWS

When it comes to developing and deploying advanced AI models, access to scalable, efficient GPU infrastructure is critical. But managing this infrastructure...

5 MIN READ

Jul 14, 2025

Upcoming Livestream: Techniques for Building High-Performance RAG Applications

Discover leaderboard-winning RAG techniques, integration strategies, and deployment best practices.

1 MIN READ

Jul 14, 2025

Just Released: NVDIA Run:ai 2.22

NVDIA Run:ai 2.22 is now here. It brings advanced inference capabilities, smarter workload management, and more controls.

1 MIN READ

Jul 11, 2025

Improving Synthetic Data Augmentation and Human Action Recognition with SynthDa

Human action recognition is a capability in AI systems designed for safety-critical applications, such as surveillance, eldercare, and industrial monitoring....

10 MIN READ

Jul 10, 2025

From Terabytes to Turnkey: AI-Powered Climate Models Go Mainstream

In the race to understand our planet’s changing climate, speed and accuracy are everything. But today’s most widely used climate simulators often struggle:...

7 MIN READ

Jul 10, 2025

Accelerating Video Production and Customization with GliaCloud and NVIDIA Omniverse Libraries

The proliferation of generative AI video models, along with the new workflows these models have introduced, has significantly accelerated production efficiency...

4 MIN READ

Jul 09, 2025

Reinforcement Learning with NVIDIA NeMo-RL: Reproducing a DeepScaleR Recipe Using GRPO

Reinforcement learning (RL) is the backbone of interactive AI. It is fundamental for teaching agents to reason and learn from human preferences, enabling...

5 MIN READ