Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy
This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading...
Rapidly Triage Container Security with the Vulnerability Analysis NVIDIA NIM Agent Blueprint
Addressing software security issues is becoming more challenging as the number of vulnerabilities reported in the CVE database continues to grow at an...
Accelerate Large Linear Programming Problems with NVIDIA cuOpt
The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...
NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library
Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...
Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
Rapidly Triage Container Security with the Vulnerability Analysis NVIDIA NIM Agent Blueprint
Addressing software security issues is becoming more challenging as the number of vulnerabilities reported in the CVE database continues to grow at an...
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
New Reward Model Helps Improve LLM Alignment with Human Preferences
Reinforcement learning from human feedback (RLHF) is essential for developing AI systems that are aligned with human values and preferences. RLHF enables the...
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
Evolving AI-Powered Game Development with Retrieval-Augmented Generation
Game development is a complex and resource-intensive process, particularly when using advanced tools like Unreal Engine. Developers find themselves navigating...
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy
This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading...
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Originally published on July 29, 2024, this post was updated on October 8, 2024. Robots need to be adaptable, readily learning new skills and adjusting to their...
Power Text-Generation Applications with Mistral NeMo 12B Running on a Single GPU
NVIDIA collaborated with Mistral to co-build the next-generation language model that achieves leading performance across benchmarks in its class. With a growing...
In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...
Addressing Medical Imaging Limitations with Synthetic Data Generation
Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks
Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.
BGE-M3: Advanced Multilingual Text Retrieval Model
Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
AI techniques like large language models (LLMs) are rapidly transforming many scientific disciplines. Quantum computing is no exception. A collaboration between...
Spotlight: Montai Builds a Multimodal AI Platform for Drug Discovery Using NVIDIA NIM Microservices
Drug discovery aims to develop new therapeutic agents that effectively target diseases while minimizing side effects for patients. Using multimodal data—such...
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
Spotlight: Petrobras Speeds Up Linear Solvers for Reservoir Simulation Using NVIDIA Grace CPU
Reservoir simulation helps reservoir engineers optimize their resource exploration approach by simulating complex scenarios and comparing with real-world field...
New AI-Powered 3D Printing Can Help Surgeons Rehearse Procedures
Researchers at Washington State University (WSU) unveiled a new AI-guided 3D printing technique that can help physicians print intricate replicas of human...
Spotlight: SLB and NVIDIA Collaborate on Generative AI Solutions for Energy
Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
Using Generative AI to Enable Robots to Reason and Act with ReMEmbR
Vision-language models (VLMs) combine the powerful language understanding of foundational LLMs with the vision capabilities of vision transformers (ViTs) by...
Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking
This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
Build VLM-Powered Visual AI Agents Using NVIDIA NIM and NVIDIA VIA Microservices
Traditional video analytics applications and their development workflow are typically built on fixed-function, limited models that are designed to detect and...
Fast-Track Robot Learning in Simulation Using NVIDIA Isaac Lab
Originally published on July 29, 2024, this post was updated on October 8, 2024. Robots need to be adaptable, readily learning new skills and adjusting to their...
Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries
Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...
Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus
The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...
Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data
Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...
AI-Enhanced Navigation Charts Safer Waters for Massive Ships
Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim
As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab
Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...
Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs
Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
High-Tech AI Framework Transforms Global Marine Pollution Tracking
An AI-powered remote sensing study offers a dynamic new tool for global ocean cleanup efforts. Detailed in the ISPRS Journal of Photogrammetry and Remote...
AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments
A recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
Accelerate Large Linear Programming Problems with NVIDIA cuOpt
The evolution of linear programming (LP) solvers has been marked by significant milestones over the past century, from Simplex to the interior point method...
NVIDIA CUDA-X Now Accelerates the Polars Data Processing Library
Polars, one of the fastest-growing data analytics tools, has just crossed 9M monthly downloads. As a modern DataFrame library, it is designed for efficiently...
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
Antarctica plays a crucial role in regulating ?Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
With the rapid expansion of language models over the past 18 months, hundreds of variants are now available. These include large language models (LLMs), small...
AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases
A groundbreaking drug-repurposing AI model could bring new hope to doctors and patients trying to treat diseases with limited or no existing treatment options....
AI Chatbot Delivers Multilingual Support to African Farmers
Some of Africa’s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...
Harnessing Data with AI to Boost Zero Trust Cyber Defense
Modern cyber threats have grown increasingly sophisticated, posing significant risks to federal agencies and critical infrastructure. According to Deloitte,...
Producing Cinematic Content at Scale with a Generative AI-Enabled OpenUSD Pipeline
Producing commercials is resource-intensive, requiring physical locations and various props and setups to display products in different settings and...
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
Simplify and Scale AI-Powered MetaHuman Deployment with NVIDIA ACE and Unreal Engine 5
At Unreal Fest 2024, NVIDIA released new Unreal Engine 5 on-device plugins for NVIDIA ACE, making it easier to build and deploy AI-powered MetaHuman characters...
Orchestrating Innovation at Scale with NVIDIA Maxine and Texel
The NVIDIA Maxine AI developer platform is a suite of NVIDIA NIM microservices, cloud-accelerated microservices, and SDKs that offer state-of-the-art features...
Enabling Customizable GPU-Accelerated Video Transcoding Pipelines
Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
Transform Live Media Pipelines with NVIDIA Holoscan for Media
NVIDIA Holoscan for Media is now ready to be used in live production, taking advantage of the best of both networking and GPU technologies. Holoscan for...
Fast Inversion for Real-Time Image Editing with Text
Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
Deploy the First On-Device Small Language Model for Improved Game Character Roleplay
At Gamescom 2024, NVIDIA announced our first on-device small language model (SLM) for improving the conversation abilities of game characters. We also announced...
Elevating Video Communication with the NVIDIA Maxine AI Developer Platform and VideoRequest
Effective video communication is important for everyone who communicates online. For businesses, educators, and content creators, it is vital. NVIDIA Maxine is...
Shader Debugging Made Easy with NVIDIA Nsight Graphics
Shaders are specialized programs that run on the GPU that manipulate rays, pixels, vertices, and textures to achieve unique visual effects. With shaders, you...
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint
Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...
Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo
NVIDIA NeMo has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry, particularly those topping the Hugging...
Quickly Voice Your Apps with NVIDIA NIM Microservices for Speech and Translation
NVIDIA NIM, part of NVIDIA AI Enterprise, provides containers to self-host GPU-accelerated inferencing microservices for pretrained and customized AI models...
Optimizing Data Center Performance with AI Agents and the OODA Loop Strategy
For any data center, operating large, complex GPU clusters is not for the faint of heart! There is a tremendous amount of complexity. Cooling, power,...
Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer
As large language models (LLMs) are becoming even bigger, it is increasingly important to provide easy-to-use and efficient deployment paths because the cost of...
Achieving State-of-the-Art Zero-Shot Waveform Audio Generation across Audio Types
Stunning audio content is an essential component of virtual worlds. Audio generative AI plays a key role in creating this content, and NVIDIA is continuously...
Deploy Diverse AI Apps with Multi-LoRA Support on RTX AI PCs and Workstations
Today’s large language models (LLMs) achieve unprecedented results across many use cases. Yet, application developers often need to customize and tune these...
The advent of large language models (LLMs) has significantly benefited the AI industry, offering versatile tools capable of generating human-like text and...
Practical Strategies for Optimizing LLM Inference Sizing and Performance
As the use of large language models (LLMs) grows across many applications, such as chatbots and content creation, it's important to understand the process of...
Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan
Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
Antarctica plays a crucial role in regulating ?Earth’s climate. Most climate research into the world’s coldest, most windswept continent focuses on the...
How AI and Robotics are Driving Agricultural Productivity and Sustainability
By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
Using Generative AI to Enable Robots to Reason and Act with ReMEmbR
Vision-language models (VLMs) combine the powerful language understanding of foundational LLMs with the vision capabilities of vision transformers (ViTs) by...
AI Tool Helps Farmers Combat Crop Loss and Climate Change
Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
New Foundational Models and Training Capabilities with NVIDIA TAO 5.5
NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
Profit and Loss Modeling on GPUs with ISO C++ Language Parallelism
The previous post How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism demonstrated how to write a Black-Scholes simulation using ISO C++...
Spotlight: HP 3D Printing Open Sources AI Surrogates for Additive Manufacturing Using NVIDIA Modulus
An open ecosystem for physics-informed machine learning (physics-ML) fosters innovation and AI engineering applications. Physics-ML embeds into the learning...
Mistral-NeMo-Minitron 8B Model Delivers Unparalleled Accuracy
This post was originally published August 21, 2024 but has been revised with current data. Recently, NVIDIA and Mistral AI unveiled Mistral NeMo 12B, a leading...
Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...
Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries
Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
Gaming has always pushed the boundaries of graphics hardware. The most popular games typically required robust GPU, CPU, and RAM resources on a user’s PC or...
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator
Developers have shown a lot of excitement for NVIDIA NIM microservices, a set of easy-to-use cloud-native microservices that shortens the time-to-market and...
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
Many of the most exciting applications of large language models (LLMs), such as interactive speech bots, coding co-pilots, and search, need to begin responding...
Developing Next-Generation Wireless Networks with NVIDIA Aerial Omniverse Digital Twin
The journey to 6G has begun, offering opportunities to deliver a network infrastructure that is performant, efficient, resilient, and adaptable. 6G networks...
Spotlight: Petrobras Speeds Up Linear Solvers for Reservoir Simulation Using NVIDIA Grace CPU
Reservoir simulation helps reservoir engineers optimize their resource exploration approach by simulating complex scenarios and comparing with real-world field...
Spotlight: SLB and NVIDIA Collaborate on Generative AI Solutions for Energy
Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...
Accelerating Oracle Database Generative AI Workloads with NVIDIA NIM and NVIDIA cuVS
The vast majority of the world's data remains untapped, and enterprises are looking to generate value from this data by creating the next wave of generative AI...