featured – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-23T19:27:29Z http://www.open-lab.net/blog/feed/ Peter Noell <![CDATA[Spotlight: Infleqtion Optimizes Portfolios Using Q-CHOP and NVIDIA CUDA-Q Dynamics]]> http://www.open-lab.net/blog/?p=100657 2025-05-22T19:52:13Z 2025-05-22T19:30:38Z Computing is an essential tool for the modern financial services industry. Profits are won and lost based on the speed and accuracy of algorithms guiding...]]> Computing is an essential tool for the modern financial services industry. Profits are won and lost based on the speed and accuracy of algorithms guiding...financial chart

Computing is an essential tool for the modern financial services industry. Profits are won and lost based on the speed and accuracy of algorithms guiding financial decision making. Accelerated quantum computing has the potential to impact the financial services industry with new algorithms able to speed-up or enhance existing tools, such as portfolio optimization techniques.

Source

]]>
0
Graham Lopez <![CDATA[Just Released: NVIDIA HPC SDK v25.5]]> http://www.open-lab.net/blog/?p=100636 2025-05-21T16:33:53Z 2025-05-21T16:32:58Z The new release includes support for CUDA 12.9, updated library components, and performance improvements.]]> The new release includes support for CUDA 12.9, updated library components, and performance improvements.

The new release includes support for CUDA 12.9, updated library components, and performance improvements.

Source

]]>
0
Rachel Ho <![CDATA[Just Announced: Join the Google Cloud & NVIDIA Developer Community]]> http://www.open-lab.net/blog/?p=100576 2025-05-20T21:06:12Z 2025-05-20T20:30:00Z Master AI with Google Cloud & NVIDIA. Access an exclusive community, resources, and rewards.]]> Master AI with Google Cloud & NVIDIA. Access an exclusive community, resources, and rewards.image to announce the collaboration

Master AI with Google Cloud & NVIDIA. Access an exclusive community, resources, and rewards.

Source

]]>
0
Gunjan Mehta <![CDATA[NVIDIA TensorRT for RTX Introduces an Optimized Inference AI Library on Windows 11]]> http://www.open-lab.net/blog/?p=100333 2025-05-19T02:36:15Z 2025-05-19T16:00:00Z AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...]]> AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference...Decorative image.

AI experiences are rapidly expanding on Windows in creativity, gaming, and productivity apps. There are various frameworks available to accelerate AI inference in these apps locally on a desktop, laptop, or workstation. Developers need to navigate a broad ecosystem. They must choose between hardware-specific libraries for maximum performance, or cross-vendor frameworks like DirectML��

Source

]]>
0
Jaya Venkatesh <![CDATA[Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled]]> http://www.open-lab.net/blog/?p=100059 2025-05-15T19:07:18Z 2025-05-15T18:31:36Z Imagine analyzing millions of NYC ride-share journeys��tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...]]> Imagine analyzing millions of NYC ride-share journeys��tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...An image of NYC taxis.

Imagine analyzing millions of NYC ride-share journeys��tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup locations. The publicly available New York City Taxi and Limousine Commission (TLC) Trip Record Data contains valuable information that could reveal game-changing insights, but traditional processing approaches leave analysts waiting hours for results��

Source

]]>
0
Matt Ahrens <![CDATA[Predicting Performance on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=100118 2025-05-15T19:07:19Z 2025-05-15T17:00:00Z The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...]]> The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...

The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform for scale-out analytics, handling massive datasets for ETL, machine learning, and deep learning workloads. While traditionally CPU-based, the advent of GPU acceleration offers a compelling promise: significant speedups for data processing��

Source

]]>
0
Louis Bavoil <![CDATA[Path Tracing Optimizations in Indiana Jones?: Opacity MicroMaps and Compaction of Dynamic BLASs]]> http://www.open-lab.net/blog/?p=98909 2025-05-15T19:07:20Z 2025-05-15T15:30:00Z The first post in this series, Path Tracing Optimization in Indiana Jones?: Shader Execution Reordering and Live State Reductions, covered ray-gen shader...]]> The first post in this series, Path Tracing Optimization in Indiana Jones?: Shader Execution Reordering and Live State Reductions, covered ray-gen shader...A still from the game, Indiana Jones and the Great Circle.

The first post in this series, Path Tracing Optimization in Indiana Jones?: Shader Execution Reordering and Live State Reductions, covered ray-gen shader level optimizations that sped up the main path-tracing pass (��TraceMain��) of Indiana Jones and the Great Circle?. This second blog post covers additional GPU optimizations that were made at the level of the ray-tracing acceleration��

Source

]]>
0
Louis Bavoil <![CDATA[Path Tracing Optimization in Indiana Jones?: Shader Execution Reordering and Live State Reductions]]> http://www.open-lab.net/blog/?p=98587 2025-05-15T19:07:21Z 2025-05-15T15:30:00Z This post is part of the Path Tracing Optimizations in Indiana Jones? series.   While adding a path-tracing mode to Indiana Jones and the Great Circle?...]]> This post is part of the Path Tracing Optimizations in Indiana Jones? series.   While adding a path-tracing mode to Indiana Jones and the Great Circle?...

This post is part of the Path Tracing Optimizations in Indiana Jones series. While adding a path-tracing mode to Indiana Jones and the Great Circle in 2024, we used Shader Execution Reordering (SER), a feature available on NVIDIA GPUs since the NVIDIA GeForce RTX 40 Series, to improve the GPU performance. To optimize the use of SER in the main path-tracing pass (), we used the NVIDIA��

Source

]]>
0
Elias Wolfberg <![CDATA[AI Helps Uncover Potential Alzheimer��s Cause and Treatment]]> http://www.open-lab.net/blog/?p=100058 2025-05-15T19:07:22Z 2025-05-15T15:00:30Z A gene that can be an early indicator for Alzheimer��s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of...]]> A gene that can be an early indicator for Alzheimer��s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of...

A gene that can be an early indicator for Alzheimer��s disease actually is a cause of the degenerative-brain disorder, said researchers at the University of California, San Diego. That finding, which they discovered using AI, could result in new treatment options. In a paper published in April in the scientific journal Cell, a team at UCSD found that the gene PHGDH��previously considered a��

Source

]]>
0
Michael Anderson <![CDATA[Accelerating Embedding Lookups with cuEmbed]]> http://www.open-lab.net/blog/?p=96714 2025-05-15T19:07:23Z 2025-05-15T15:00:00Z NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...]]> NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you're building...A drawing of a person holding a phone, with a callout of the phone screen and chat bubbles.

NVIDIA recently released cuEmbed, a high-performance, header-only CUDA library that accelerates embedding lookups on NVIDIA GPUs. If you��re building recommendation systems, embedding operations are likely consuming significant computational resources. Embedding lookups present a unique optimization challenge. They��re memory-intensive operations with irregular access patterns.

Source

]]>
0
Vinh Nguyen <![CDATA[Build Custom Reasoning Models with Advanced, Open Post-Training Datasets]]> http://www.open-lab.net/blog/?p=98680 2025-05-15T19:07:23Z 2025-05-14T16:33:26Z Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...]]> Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...How the Llama-Nemotron 30M Post Training Dataset was created

Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from either a single or cohort of open-source, commercially permissible LLMs, a base LLM is finetuned either with supervised finetuning or RLHF to gain instruction-following and reasoning skills. This process can be seen as a knowledge��

Source

]]>
0
Berkin Kartal <![CDATA[AI Fabric Resiliency and Why Network Convergence Matters]]> http://www.open-lab.net/blog/?p=98574 2025-05-15T19:07:25Z 2025-05-14T16:20:00Z High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication...]]> High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication...Typical data center interconnection schema for Clos fabric.

High-performance computing and deep learning workloads are extremely sensitive to latency. Packet loss forces retransmission or stalls in the communication pipeline, which directly increases latency and disrupts the synchronization between GPUs. This can degrade the performance of collective operations such as all-reduce or broadcast, where every GPU��s participation is required before progressing.

Source

]]>
0
Brad Nemire <![CDATA[Get Trained and Certified at GTC Paris at VivaTech 2025]]> http://www.open-lab.net/blog/?p=100034 2025-05-15T19:07:25Z 2025-05-14T16:16:06Z Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.]]> Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.image of the paris skyline

Join us at GTC Paris on June 10th and choose from six full-day, instructor-led workshops.

Source

]]>
0
Gunjan Mehta <![CDATA[NVIDIA TensorRT Unlocks FP4 Image Generation ?for NVIDIA Blackwell GeForce RTX 50 Series GPUs]]> http://www.open-lab.net/blog/?p=99256 2025-05-15T19:07:26Z 2025-05-14T15:05:11Z The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX...]]> The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX...Four tiles of city scenes.

The launch of the NVIDIA Blackwell platform ushered in a new era of improvements in generative AI technology. At its forefront is the newly launched GeForce RTX 50 series GPUs for PCs and workstations that boast fifth-generation Tensor Cores with 4-bit floating point compute (FP4)��a must-have for accelerating advanced generative AI models like FLUX from Black Forest Labs. As the latest image��

Source

]]>
0
Sophia Schuur <![CDATA[Connect Simulations with the Real World Using NVIDIA Air Services]]> http://www.open-lab.net/blog/?p=99778 2025-05-15T19:07:28Z 2025-05-13T18:00:00Z NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. With NVIDIA Air, you can spin up...]]> NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. With NVIDIA Air, you can spin up...

NVIDIA Air enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. With NVIDIA Air, you can spin up hundreds of switches and servers and configure them with a single script. One of the many advantages of NVIDIA Air is the ability to connect your simulations with the real world. Enabling an external connection in your environment can��

Source

]]>
0
Brad Nemire <![CDATA[Just Released: NVIDIA Warp is Now Open-Source Under Apache 2.0]]> http://www.open-lab.net/blog/?p=99970 2025-05-15T19:07:28Z 2025-05-12T18:41:04Z NVIDIA Warp, a simulation computing framework, is now accessible to all developers.]]> NVIDIA Warp, a simulation computing framework, is now accessible to all developers.

NVIDIA Warp, a simulation computing framework, is now accessible to all developers.

Source

]]>
0
Alex Zeltov <![CDATA[Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry]]> http://www.open-lab.net/blog/?p=99911 2025-05-15T19:07:29Z 2025-05-12T17:59:36Z The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...]]> The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with Azure��s scalable, secure infrastructure, organizations can now deploy powerful, ready-to-use AI models more efficiently than ever before. NIM microservices are containerized for GPU-accelerated inferencing for pretrained and customized��

Source

]]>
0
Shashank Verma <![CDATA[Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework]]> http://www.open-lab.net/blog/?p=99933 2025-05-15T19:07:31Z 2025-05-12T17:48:24Z As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...]]> As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...

As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By using state-of-the-art models on Day-0, teams can harness these innovations efficiently, maintain relevance, and be competitive. The past year has seen a flurry of exciting model series releases in the open-source community��

Source

]]>
0
Jaydeep Marathe <![CDATA[CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage]]> http://www.open-lab.net/blog/?p=99693 2025-05-15T19:07:32Z 2025-05-09T16:51:02Z In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...]]> In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...

Source

]]>
0
Rucha Apte <![CDATA[Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research]]> http://www.open-lab.net/blog/?p=99794 2025-05-15T19:07:33Z 2025-05-09T16:00:00Z Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...]]> Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...An illustration showing molecules and a brain.

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates per day. In this blog post, we explore how domain-adapted large language models (LLMs), enhanced with reasoning capabilities, are transforming scientific research, especially in high-stakes, complex domains like battery innovation.

Source

]]>
0
Dhruv Nandakumar <![CDATA[Applying Autoencoder-Based GNNs for High-Throughput Network Anomaly Detection in NetFlow Data]]> http://www.open-lab.net/blog/?p=99171 2025-05-15T19:07:34Z 2025-05-08T22:18:41Z As modern enterprise and cloud environments scale, the complexity and volume of network traffic increase dramatically. NetFlow is used to record metadata about...]]> As modern enterprise and cloud environments scale, the complexity and volume of network traffic increase dramatically. NetFlow is used to record metadata about...cybersecurity image

As modern enterprise and cloud environments scale, the complexity and volume of network traffic increase dramatically. NetFlow is used to record metadata about the traffic flows traversing a network device such as a router, switch, or host. NetFlow data, essential for understanding network traffic, can be effectively modeled as graphs where edges capture properties such as connection duration and��

Source

]]>
1
Wenqi Glantz <![CDATA[Extending the NVIDIA Agent Intelligence Toolkit to Support New Agentic Frameworks]]> http://www.open-lab.net/blog/?p=99799 2025-05-15T19:07:35Z 2025-05-08T18:30:00Z NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...]]> NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to...

NVIDIA Agent Intelligence toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents. It focuses on enabling developers to quickly build, evaluate, profile, and accelerate complex agentic AI workflows?��?systems in which multiple AI agents collaborate to perform tasks. The Agent Intelligence toolkit acts as a unifying framework that integrates existing��

Source

]]>
0
Kyle Aubrey <![CDATA[Turbocharge LLM Training Across Long-Haul Data Center Networks with NVIDIA Nemo Framework]]> http://www.open-lab.net/blog/?p=99764 2025-05-15T19:07:37Z 2025-05-08T18:28:58Z Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...]]> Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for...A multi-data center illustration.

Multi-data center training is becoming essential for AI factories as pretraining scaling fuels the creation of even larger models, leading the demand for computing performance to outpace the capabilities of a single facility. By distributing workloads across multiple data centers, organizations can overcome limitations in power, cooling, and space, enabling the training of even larger��

Source

]]>
0
Ruilong Li <![CDATA[Revolutionizing Neural Reconstruction and Rendering in gsplat with 3DGUT]]> http://www.open-lab.net/blog/?p=99680 2025-05-15T19:07:38Z 2025-05-08T16:09:03Z Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....]]> Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins....

Realistic 3D simulation is becoming a cornerstone of modern AI and graphics, from training autonomous vehicles (AV) to powering robotics and digital twins. Neural rendering techniques like NeRFs and 3D Gaussian Splatting (3DGS) have revolutionized how 3D scenes are reconstructed and visualized from raw sensor data. In this post, we introduce the implementation of 3D Gaussian Unscented��

Source

]]>
0
Rishi Chandra <![CDATA[Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud]]> http://www.open-lab.net/blog/?p=99585 2025-05-15T19:07:40Z 2025-05-08T16:00:00Z Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data��documents, emails,...]]> Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data��documents, emails,...

Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data��documents, emails, multimedia content��deep learning (DL) and large language models (LLMs) have become core components of the modern data analytics pipeline. These models enable a variety of downstream tasks, such as image captioning, semantic tagging��

Source

]]>
0
Kang Xu <![CDATA[Spotlight: Accelerating the Discovery of New Battery Materials with SES AI��s Molecular Universe]]> http://www.open-lab.net/blog/?p=99608 2025-05-15T19:07:41Z 2025-05-08T15:00:00Z From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress...]]> From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress...

From the Stone Age to the digital era, materials have been the foundation of our civilization across all epochs. Today, finding new materials leads to progress in energy, medicine, and advancements in technology. This creates a future of endless possibilities, however, there are still challenges. Human-powered approaches to finding new materials have been slow, costly, unexpected, and limited to a��

Source

]]>
0
Camden Spehl <![CDATA[Concept?Driven AI Teaching Assistant Guides Students to Deeper Insights]]> http://www.open-lab.net/blog/?p=99719 2025-05-15T19:07:42Z 2025-05-07T20:57:51Z In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...]]> In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...Students sitting around a computer.

In today��s educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information, they��ve also created new concerns about academic integrity. Increasingly, students rely on AI to generate direct answers to homework questions, often at the expense of developing critical thinking skills and mastering core concepts.

Source

]]>
1
Nirmal Kumar Juluru <![CDATA[Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=99540 2025-05-15T19:07:43Z 2025-05-07T16:22:31Z Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...]]> Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable developers to build highly accurate LLMs, NVIDIA previously released Nemotron-CC, a 6.3-trillion-token English language Common Crawl (CC) dataset. Today, the NVIDIA NeMo Curator team is excited to share that the pipeline used to build the��

Source

]]>
0
Justine Lin <![CDATA[Using Python to Automate 3D Workflows with OpenUSD?]]> http://www.open-lab.net/blog/?p=99493 2025-05-15T19:07:44Z 2025-05-07T16:00:00Z Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex...]]> Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex...

Universal Scene Description (OpenUSD) offers a powerful, open, and extensible ecosystem for describing, composing, simulating, and collaborating within complex 3D worlds. From handling massive datasets and automating workflows for digital twins to enabling real-time rendering for games and streamlining industrial operations in manufacturing and energy, it is transforming how industries work with��

Source

]]>
0
Vinh Nguyen <![CDATA[LLM Inference Benchmarking Guide: NVIDIA GenAI-Perf and NIM]]> http://www.open-lab.net/blog/?p=99180 2025-05-15T19:07:45Z 2025-05-06T17:35:39Z This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...]]> This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...Decorative image of a datacenter with floating icons overlaid.

This is the second post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. When building LLM-based applications, it is critical to understand the performance characteristics of these models on a given hardware. This serves multiple purposes: As a client-side LLM-focused benchmarking tool��

Source

]]>
0
Weiji Chen <![CDATA[New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection]]> http://www.open-lab.net/blog/?p=99642 2025-05-15T19:07:47Z 2025-05-06T16:22:57Z Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining...]]> Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining...

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining logistics, forecasting markets, or anticipating machine failures, organizations need more sophisticated tools than traditional methods can offer. NVIDIA GPU-accelerated deep learning is enabling industries to gain real-time analytics.

Source

]]>
0
Greg Jones <![CDATA[Powering Next-Gen XR Design at Rivian with NVIDIA RTX PRO Blackwell Desktop GPUs]]> http://www.open-lab.net/blog/?p=99070 2025-05-15T19:07:48Z 2025-05-06T16:00:00Z For professionals pushing the boundaries of XR, creating the most immersive and highest fidelity experiences is always challenging. Demanding XR workflows push...]]> For professionals pushing the boundaries of XR, creating the most immersive and highest fidelity experiences is always challenging. Demanding XR workflows push...Image of someone using a VR headset driving a simular

For professionals pushing the boundaries of XR, creating the most immersive and highest fidelity experiences is always challenging. Demanding XR workflows push the performance limits when rendering massive datasets and driving the latest ultra-high-resolution advanced XR headsets. Simultaneously integrating advanced artificial intelligence capabilities for more interactive and intuitive��

Source

]]>
0
Jonathan Bentz <![CDATA[Just Released: CUDA 12.9]]> http://www.open-lab.net/blog/?p=99599 2025-05-15T19:07:49Z 2025-05-05T15:39:54Z New features include enhancements to confidential computing and family-specific features and targets supported by NVCC.?]]> New features include enhancements to confidential computing and family-specific features and targets supported by NVCC.?

New features include enhancements to confidential computing and family-specific features and targets supported by NVCC.

Source

]]>
0
Ankit Patel <![CDATA[Integrate and Deploy Tongyi Qwen3 Models into Production Applications with NVIDIA]]> http://www.open-lab.net/blog/?p=99462 2025-05-15T19:07:50Z 2025-05-02T22:00:00Z Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models,...]]> Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models,...

Alibaba recently released Tongyi Qwen3, a family of open-source hybrid-reasoning large language models (LLMs). The Qwen3 family consists of two MoE models, 235B-A22B (235B total parameters and 22B active parameters) and 30B-A3B, and six dense models, including the 0.6B, 1.7B, 4B, 8B, 14B, 32B versions. With ultra-fast token generation, developers can efficiently integrate and deploy Qwen3��

Source

]]>
0
Mark Harris <![CDATA[An Even Easier Introduction to CUDA (Updated)]]> http://www.open-lab.net/blog/parallelforall/?p=7501 2025-05-19T16:20:29Z 2025-05-02T17:31:00Z Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...]]> Note: This blog post was originally published on Jan 25, 2017, but has been edited to reflect new updates. This post is a super simple introduction to CUDA, the...

Source

]]>
141
Brad Nemire <![CDATA[HackAI Challenge Winners Announced]]> http://www.open-lab.net/blog/?p=99563 2025-05-15T19:08:27Z 2025-05-02T16:31:11Z Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.]]> Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.

Explore the groundbreaking projects and real-world impacts of the HackAI Challenge powered by NVIDIA AI Workbench and Dell Precision.

Source

]]>
0
Jonathan Bentz <![CDATA[NVIDIA Blackwell and NVIDIA CUDA 12.9 Introduce Family-Specific Architecture Features]]> http://www.open-lab.net/blog/?p=98753 2025-05-15T19:08:27Z 2025-05-01T22:39:39Z One of the earliest architectural design decisions that went into the CUDA platform for NVIDIA GPUs was support for backward compatibility of GPU code. This...]]> One of the earliest architectural design decisions that went into the CUDA platform for NVIDIA GPUs was support for backward compatibility of GPU code. This...

Source

]]>
0
Babak Hejazi <![CDATA[Boosting Matrix Multiplication Speed and Flexibility with NVIDIA cuBLAS 12.9]]> http://www.open-lab.net/blog/?p=99184 2025-05-15T19:08:28Z 2025-05-01T20:00:00Z The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more.   Two...]]> The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more.   Two...An image representing matrix multiplication.

The NVIDIA CUDA-X math libraries empower developers to build accelerated applications for AI, scientific computing, data processing, and more. Two of the most important applications of CUDA-X libraries are training and inference LLMs, whether for use in everyday consumer applications or highly specialized scientific domains like drug discovery. Multiple CUDA-X libraries are indispensable��

Source

]]>
0
Allison Ding <![CDATA[Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML]]> http://www.open-lab.net/blog/?p=99417 2025-05-15T19:08:30Z 2025-05-01T18:35:18Z Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...]]> Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...

Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive performance. On the other hand, hyperparameter optimization (HPO) involves systematically searching for the best set of hyperparameters to maximize the performance of a given ML algorithm. A common challenge when using both stacking and HPO��

Source

]]>
0
Jonathan Bikoff <![CDATA[Spotlight: Personal AI Brings AI Receptionists to Small Business Owners with NVIDIA Riva]]> http://www.open-lab.net/blog/?p=99402 2025-05-16T23:50:38Z 2025-04-29T22:44:07Z It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...]]> It's 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the...

It��s 10 p.m. on a Tuesday when the phone rings at the Sapochnick Law Firm, a specialized law practice in San Diego, California. The caller, a client of the firm, is anxious as the phone rings. They received an important letter containing? potentially life-changing news, and had urgent questions for their lawyer. The client quickly realizes the Sapochnick team likely left the office hours ago��

Source

]]>
1
Joseph Lucas <![CDATA[Structuring Applications to Secure the KV Cache]]> http://www.open-lab.net/blog/?p=99425 2025-05-15T19:08:32Z 2025-04-29T22:43:01Z When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...]]> When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the...

When interacting with transformer-based models like large language models (LLMs) and vision-language models (VLMs), the structure of the input shapes the model��s output. But prompts are often more than a simple user query. In practice, they optimize the response by dynamically assembling data from various sources such as system instructions, context data, and user input.

Source

]]>
0
Jenn Yonemitsu <![CDATA[Kaggle Grandmasters Unveil Winning Strategies for Data Science Superpowers]]> http://www.open-lab.net/blog/?p=99350 2025-05-15T19:08:33Z 2025-04-29T17:22:59Z Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next...]]> Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next...A fireside chat with Kaggle Grandmasters.

Kaggle Grandmasters David Austin and Chris Deotte from NVIDIA and Ruchi Bhatia from HP joined Brenda Flynn from Kaggle at this year��s Google Cloud Next conference in Las Vegas. They shared a bit about who they are, what motivates them to compete, and how they contribute to and win competitions on the world��s largest data science competition platform. This blog post captures a glimpse of��

Source

]]>
0
Sama Bali <![CDATA[Choosing Your First Local AI Project?]]> http://www.open-lab.net/blog/?p=99361 2025-05-15T19:08:34Z 2025-04-29T17:00:00Z AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced...]]> AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced...An illustration representing generative AI.

AI is rapidly moving beyond centralized cloud and data centers, becoming a powerful tool deployable directly on professional workstations. Thanks to advanced hardware and optimized software, you can build, run, and experiment with sophisticated AI models at your desk or on the go. Welcome to the world of local AI development! Running and developing AI locally on a workstation offers��

Source

]]>
0
Meenakshi Kaushik <![CDATA[NVIDIA NIM Operator 2.0 Boosts AI Deployment with NVIDIA NeMo Microservices Support]]> http://www.open-lab.net/blog/?p=99309 2025-05-15T19:08:34Z 2025-04-29T16:00:00Z The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...]]> The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the...Decorative image.

The first release of NVIDIA NIM Operator simplified the deployment and lifecycle management of inference pipelines for NVIDIA NIM microservices, reducing the workload for MLOps, LLMOps engineers, and Kubernetes admins. It enabled easy and fast deployment, auto-scaling, and upgrading of NIM on Kubernetes clusters. Learn more about the first release. Our customers and partners have been using��

Source

]]>
0
Elias Wolfberg <![CDATA[How SETI Uses AI to Search for Intelligent Alien Life]]> http://www.open-lab.net/blog/?p=99382 2025-05-15T19:08:36Z 2025-04-28T22:57:24Z A researcher from the SETI Institute described to a packed audience at GTC 2025 how SETI had successfully trialed a novel method to identify interstellar radio...]]> A researcher from the SETI Institute described to a packed audience at GTC 2025 how SETI had successfully trialed a novel method to identify interstellar radio...

A researcher from the SETI Institute described to a packed audience at GTC 2025 how SETI had successfully trialed a novel method to identify interstellar radio waves which, theoretically, can also be used to identify communication from intelligent extraterrestrial life. Luigi Cruz, a staff engineer at SETI, the world��s foremost organization looking for signs of intelligent life on other��

Source

]]>
0
Hsin Chen <![CDATA[Advancing Cybersecurity Operations with Agentic AI Systems]]> http://www.open-lab.net/blog/?p=99329 2025-05-15T19:08:37Z 2025-04-28T19:52:53Z The age of passive AI is over. A new era is beginning, where AI doesn��t just respond��it thinks, plans, and acts. The rapid advancement of large language...]]> The age of passive AI is over. A new era is beginning, where AI doesn��t just respond��it thinks, plans, and acts. The rapid advancement of large language...

The age of passive AI is over. A new era is beginning, where AI doesn��t just respond��it thinks, plans, and acts. The rapid advancement of large language models (LLMs) has unlocked the potential of agentic AI systems, enabling the automation of tedious tasks across many fields, including cybersecurity. Traditionally, AI applications in cybersecurity have focused primarily on detecting��

Source

]]>
0
Asawaree Bhide <![CDATA[R2D2: Adapting Dexterous Robots with NVIDIA Research Workflows and Models]]> http://www.open-lab.net/blog/?p=98754 2025-05-15T19:08:38Z 2025-04-25T16:00:00Z Robotic arms are used today for assembly, packaging, inspection, and many more applications. However, they are still preprogrammed to perform specific and often...]]> Robotic arms are used today for assembly, packaging, inspection, and many more applications. However, they are still preprogrammed to perform specific and often...

Robotic arms are used today for assembly, packaging, inspection, and many more applications. However, they are still preprogrammed to perform specific and often repetitive tasks. To meet the increasing need for adaptability in most environments, perceptive arms are needed to make decisions and adjust behavior based on real-time data. This leads to more flexibility across tasks in collaborative��

Source

]]>
0
Dylan Lacewell <![CDATA[Fast Ray Tracing of Dynamic Scenes Using NVIDIA OptiX 9 and NVIDIA RTX Mega Geometry]]> http://www.open-lab.net/blog/?p=99130 2025-05-15T19:08:39Z 2025-04-24T19:30:00Z Real-time ray tracing is a powerful rendering technique that can create incredibly realistic images. NVIDIA OptiX and RTX technology make this possible, even...]]> Real-time ray tracing is a powerful rendering technique that can create incredibly realistic images. NVIDIA OptiX and RTX technology make this possible, even...

Real-time ray tracing is a powerful rendering technique that can create incredibly realistic images. NVIDIA OptiX and RTX technology make this possible, even for scenes with a massive amount of detail. However, when these detailed scenes involve movement and animation, maintaining real-time ray tracing performance can be challenging. This post explores how the new RTX Mega Geometry features��

Source

]]>
0
Davide Paglieri <![CDATA[Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=99202 2025-05-15T19:08:40Z 2025-04-24T17:00:00Z This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...]]> This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM. Researchers from the University College London (UCL) Deciding, Acting, and Reasoning with Knowledge (DARK) Lab leverage NVIDIA NIM microservices in their new game-based benchmark suite, Benchmarking Agentic LLM and VLM Reasoning On Games��

Source

]]>
0
Amit Bleiweiss <![CDATA[Spotlight: Qodo Innovates Efficient Code Search with NVIDIA DGX]]> http://www.open-lab.net/blog/?p=99041 2025-05-15T19:08:41Z 2025-04-23T22:23:32Z Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...]]> Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...Decorative image.

Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there are limitations that become apparent. Challenges such as understanding the nuances of programming languages, complex dependencies, and adapting to codebase-specific context can lead to lower-quality code and cause bottlenecks down the line.

Source

]]>
0
Emily Sakata <![CDATA[Announcing NVIDIA Secure AI General Availability]]> http://www.open-lab.net/blog/?p=99064 2025-05-15T19:08:42Z 2025-04-23T22:23:11Z As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...]]> As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models...

As many enterprises move to running AI training or inference on their data, the data and the code need to be protected, especially for large language models (LLMs). Many customers can��t risk placing their data in the cloud because of data sensitivity. Such data may contain personally identifiable information (PII) or company proprietary information, and the trained model has valuable intellectual��

Source

]]>
0
Jean-Eudes Marvie <![CDATA[Real-Time GPU-Accelerated Gaussian Splatting with NVIDIA DesignWorks Sample vk_gaussian_splatting]]> http://www.open-lab.net/blog/?p=98796 2025-05-15T19:08:43Z 2025-04-23T20:00:00Z Gaussian splatting is a novel approach to rendering complex 3D scenes by representing them as a collection of anisotropic Gaussians in 3D space. This technique...]]> Gaussian splatting is a novel approach to rendering complex 3D scenes by representing them as a collection of anisotropic Gaussians in 3D space. This technique...

Gaussian splatting is a novel approach to rendering complex 3D scenes by representing them as a collection of anisotropic Gaussians in 3D space. This technique enables real-time rendering of photorealistic scenes learned from small sets of images, making it ideal for applications in gaming, virtual reality, and real-time professional visualization. vk_gaussian_splatting is a new Vulkan-based��

Source

]]>
0
Bo Dong <![CDATA[NVIDIA cuPyNumeric 25.03 Now Fully Open Source with PIP and HDF5 Support]]> http://www.open-lab.net/blog/?p=99089 2025-05-15T19:08:44Z 2025-04-23T19:26:07Z NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...]]> NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings...

NVIDIA cuPyNumeric is a library that aims to provide a distributed and accelerated drop-in replacement for NumPy built on top of the Legate framework. It brings zero-code-change scaling to multi-GPU and multinode (MGMN) accelerated computing. cuPyNumeric 25.03 is a milestone update that introduces powerful new capabilities and enhanced accessibility for users and developers alike��

Source

]]>
0
Shashank Verma <![CDATA[Enhance Your AI Agent with Data Flywheels Using NVIDIA NeMo Microservices]]> http://www.open-lab.net/blog/?p=98721 2025-05-15T19:08:45Z 2025-04-23T13:00:00Z Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...]]> Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...

Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on agentic AI systems to optimize business processes, keeping these systems aligned with evolving business needs and new data becomes crucial. This post dives into how to build an iteration of a data flywheel using NVIDIA NeMo��

Source

]]>
0
Brad Nemire <![CDATA[NVIDIA GTC Training Labs Now Available On Demand]]> http://www.open-lab.net/blog/?p=99074 2025-05-15T19:08:47Z 2025-04-22T17:26:28Z Missed GTC? This year��s training labs are now available on demand to watch anywhere, anytime.]]> Missed GTC? This year��s training labs are now available on demand to watch anywhere, anytime.

Missed GTC? This year��s training labs are now available on demand to watch anywhere, anytime.

Source

]]>
0
Michelle Horton <![CDATA[AI for a Greener Future: Its Power is in Our Hands]]> http://www.open-lab.net/blog/?p=98969 2025-05-15T19:08:47Z 2025-04-22T16:00:00Z Can AI guide us toward a more sustainable future, or is it exacerbating global energy and climate challenges?  This critical question was recently posed to...]]> Can AI guide us toward a more sustainable future, or is it exacerbating global energy and climate challenges?  This critical question was recently posed to...

Can AI guide us toward a more sustainable future, or is it exacerbating global energy and climate challenges? This critical question was recently posed to a panel of sustainability and AI experts from Columbia University, Deloitte, and the Wilson Center at NVIDIA GTC 2025. In a packed room moderated by Josh Parker, senior director of Corporate Sustainability at NVIDIA��

Source

]]>
0
Maximilian M��ller <![CDATA[Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT]]> http://www.open-lab.net/blog/?p=98927 2025-05-15T19:08:48Z 2025-04-21T18:44:38Z State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...]]> State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...

State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant computational resources and high costs. By leveraging the latest FP8 quantization features on NVIDIA Hopper GPUs with NVIDIA TensorRT, it��s possible to significantly reduce inference costs and serve more users with fewer GPUs.

Source

]]>
0
Elias Wolfberg <![CDATA[AI Inspires Artists and Industrialists to Reimagine their Crafts]]> http://www.open-lab.net/blog/?p=99010 2025-05-15T19:08:50Z 2025-04-21T18:14:32Z AI has become nearly synonymous with innovation. As it rushes onto the world stage, AI is seeding inspiration in creators and problem-solvers of all...]]> AI has become nearly synonymous with innovation. As it rushes onto the world stage, AI is seeding inspiration in creators and problem-solvers of all...A robot arm carving a sculpture.

AI has become nearly synonymous with innovation. As it rushes onto the world stage, AI is seeding inspiration in creators and problem-solvers of all stripes��from artists to more traditional industrial inventors. One of the world��s leading AI-first artists, Alexander Reben, has spent his career integrating AI into different artistic mediums. His current work explores AI and robotics and��

Source

]]>
0
Bartley Richardson https://www.linkedin.com/in/bartleyrichardson/%20 <![CDATA[Upcoming Event: NVIDIA Agent Toolkit Hackathon]]> http://www.open-lab.net/blog/?p=98965 2025-05-15T19:08:50Z 2025-04-18T17:06:38Z Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit -- contest runs May 12 to May 23.]]> Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit -- contest runs May 12 to May 23.

Build a high-performance agentic AI system using the open-source NVIDIA Agent Intelligence toolkit �� contest runs May 12 to May 23.

Source

]]>
0
Chris Deotte https://www.kaggle.com/cdeotte <![CDATA[Grandmaster Pro Tip: Winning First Place in Kaggle Competition with Feature Engineering Using cuDF pandas]]> http://www.open-lab.net/blog/?p=98938 2025-05-19T22:17:46Z 2025-04-17T23:03:20Z Feature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer...]]> Feature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer...

Feature engineering remains one of the most effective ways to improve model accuracy when working with tabular data. Unlike domains such as NLP and computer vision, where neural networks can extract rich patterns from raw inputs, the best-performing tabular models��particularly gradient-boosted decision trees��still gain a significant advantage from well-crafted features. However��

Source

]]>
0
James Bigler <![CDATA[Neural Rendering in NVIDIA OptiX Using Cooperative Vectors]]> http://www.open-lab.net/blog/?p=98814 2025-05-15T19:08:53Z 2025-04-17T17:00:00Z The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature...]]> The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature...

The release of NVIDIA OptiX 9.0 introduces a new feature called cooperative vectors that enables AI workflows as part of ray tracing kernels. The feature leverages NVIDIA RTX Tensor Cores for hardware-accelerated matrix operations and neural net computations during shading. This unlocks AI rendering techniques such as NVIDIA RTX Neural Shaders and NVIDIA RTX Neural Texture Compression (NTC) and��

Source

]]>
0
Elias Wolfberg <![CDATA[AI-Generated Heat Maps Keep Seniors and their Privacy Safe]]> http://www.open-lab.net/blog/?p=98891 2025-05-15T19:08:54Z 2025-04-16T20:00:10Z By 2030, more than one in five Americans will be 65 or older, becoming the United States�� largest group of seniors ever. Silicon Valley-based startup Butlr...]]> By 2030, more than one in five Americans will be 65 or older, becoming the United States�� largest group of seniors ever. Silicon Valley-based startup Butlr...A heatmap animated GIF.

By 2030, more than one in five Americans will be 65 or older, becoming the United States�� largest group of seniors ever. Silicon Valley-based startup Butlr has developed an AI platform designed to keep seniors safe while preserving their privacy. Their AI-based platform uses a neural network to interpret different temperature data that its sensors, which are strategically placed in��

Source

]]>
0
Daniel Rodriguez <![CDATA[Announcing ComputeEval, an Open Source Framework for Evaluating LLMs on CUDA]]> http://www.open-lab.net/blog/?p=98885 2025-05-21T19:07:26Z 2025-04-16T16:48:07Z Large language models (LLMs) are revolutionizing how developers code and how they learn to code. For seasoned or junior developers alike, today��s...]]> Large language models (LLMs) are revolutionizing how developers code and how they learn to code. For seasoned or junior developers alike, today��s...

Large language models (LLMs) are revolutionizing how developers code and how they learn to code. For seasoned or junior developers alike, today��s state-of-the-art models can generate Python scripts, React-based websites, and more. In the future, powerful AI models will assist developers in writing high-performance GPU code. This raises an important question: How can it be determined whether an LLM��

Source

]]>
0
Sebastian Haan <![CDATA[Developing an AI-Powered Tool for Automatic Citation Validation Using NVIDIA NIM]]> http://www.open-lab.net/blog/?p=98315 2025-05-15T19:08:56Z 2025-04-16T16:40:50Z The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can...]]> The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can...

The accuracy of citations is crucial for maintaining the integrity of both academic and AI-generated content. When citations are inaccurate or wrong, they can mislead readers and spread false information. As a team of researchers from the University of Sydney specializing in machine learning and AI, we are developing an AI-powered tool capable of efficiently cross-checking and analyzing semantic��

Source

]]>
0
Ziyue Xu <![CDATA[Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming]]> http://www.open-lab.net/blog/?p=98553 2025-05-01T18:35:52Z 2025-04-16T16:00:00Z Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....]]> Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....Decorative image.

Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy. However, FL faces significant challenges related to communication overhead and local resource constraints when balancing model requirements and communication capabilities. Particularly in the current era of large language models��

Source

]]>
1
Nirmal Kumar Juluru <![CDATA[NVIDIA Llama Nemotron Ultra Open Model Delivers Groundbreaking Reasoning Accuracy]]> http://www.open-lab.net/blog/?p=98855 2025-05-05T22:33:12Z 2025-04-15T18:00:00Z AI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...]]> AI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world...

AI is no longer just about generating text or images��it��s about deep reasoning, detailed problem-solving, and powerful adaptability for real-world applications in business and in financial, customer, and healthcare services. Available today, the latest Llama Nemotron Ultra reasoning model from NVIDIA delivers leading accuracy among open-source models across intelligence and coding benchmarks��

Source

]]>
0
Tanya Lenz <![CDATA[Event: Data Filtering Challenge for Training Edge Language Models]]> http://www.open-lab.net/blog/?p=98542 2025-05-01T18:35:54Z 2025-04-15T15:00:00Z You��re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.]]> You��re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.

You��re invited to join the challenge. Develop and apply innovative data filtering techniques to curate datasets that enhance edge LM performance.

Source

]]>
0
Brad Nemire <![CDATA[Just Released: NVDIA Run:ai 2.21]]> http://www.open-lab.net/blog/?p=98795 2025-05-01T18:35:55Z 2025-04-14T19:27:48Z NVIDIA Run:ai 2.21 adds GB200 NVL72 support, rolling inference updates and smarter resource controls.]]> NVIDIA Run:ai 2.21 adds GB200 NVL72 support, rolling inference updates and smarter resource controls.

NVIDIA Run:ai 2.21 adds GB200 NVL72 support, rolling inference updates and smarter resource controls.

Source

]]>
0
Ziyue Xu <![CDATA[Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch]]> http://www.open-lab.net/blog/?p=98560 2025-05-01T18:35:55Z 2025-04-11T18:37:54Z NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...]]> NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the...Decorative image.

NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the integration of NVIDIA FLARE and ExecuTorch. NVIDIA FLARE is a domain-agnostic, open-source, extensible SDK that enables researchers and data scientists to adapt existing machine learning or deep learning workflows to a federated paradigm.

Source

]]>
1
Brian Sparks <![CDATA[NVIDIA Helps Build AI Factories Faster Than Ever with NVIDIA DGX SuperPOD]]> http://www.open-lab.net/blog/?p=98579 2025-04-17T19:35:28Z 2025-04-11T18:35:30Z In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by...]]> In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by...Image of a Softbank datacenter corridor.

In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by thousands of cables humming with potential. Until last year, this sprawling AI factory didn��t exist. Now it��s poised to anchor SoftBank Corporation��s vision for AI-powered innovation, a vision rooted in creating a society that coexists��

Source

]]>
0
Michelle Horton <![CDATA[AI Advances Parkinson��s Detection Using Standard MRI Scans]]> http://www.open-lab.net/blog/?p=98636 2025-04-17T19:35:29Z 2025-04-11T16:58:59Z A simple brain scan may soon be all that's needed to accurately diagnose Parkinson��s disease, thanks to a new AI-powered tool. The advancement could help...]]> A simple brain scan may soon be all that's needed to accurately diagnose Parkinson��s disease, thanks to a new AI-powered tool. The advancement could help...

A simple brain scan may soon be all that��s needed to accurately diagnose Parkinson��s disease, thanks to a new AI-powered tool. The advancement could help doctors expedite detection and treatment, getting patients the care they need and improving their quality of life. Developed by teams from the University of Florida (UF) and top-tier medical centers, the machine learning model analyzes MRI��

Source

]]>
0
Graham Lopez <![CDATA[Just Released: NVIDIA HPC SDK v25.3]]> http://www.open-lab.net/blog/?p=98646 2025-04-17T19:35:30Z 2025-04-10T20:20:32Z The HPC SDK v25.3 release includes support for NVIDIA Blackwell GPUs and an optimized allocator for Arm CPUs.]]> The HPC SDK v25.3 release includes support for NVIDIA Blackwell GPUs and an optimized allocator for Arm CPUs.

The HPC SDK v25.3 release includes support for NVIDIA Blackwell GPUs and an optimized allocator for Arm CPUs.

Source

]]>
0
Shai Shen-Orr <![CDATA[Curating Biological Findings from Scientific Literature with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=98526 2025-04-28T23:18:36Z 2025-04-10T18:30:00Z Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...]]> Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological...

Scientific papers are highly heterogeneous, often employing diverse terminologies for the same entities, using varied methodologies to study biological phenomena, and presenting findings within distinct contexts. Extracting meaningful insights from these papers requires a profound understanding of biology, a critical evaluation of methodologies, and the ability to discern robust findings from��

Source

]]>
0
Prem Sagar Gali <![CDATA[Efficiently Scaling Polars GPU Parquet Reader]]> http://www.open-lab.net/blog/?p=98435 2025-04-22T23:52:25Z 2025-04-10T16:30:00Z When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...]]> When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...

When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for its speed and efficiency, offers a GPU-accelerated backend powered by cuDF that can significantly boost performance. However, to fully leverage the power of the Polars GPU backend, it��s essential to optimize the data loading process��

Source

]]>
0
Matheen Raza <![CDATA[Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay]]> http://www.open-lab.net/blog/?p=98533 2025-04-22T23:52:20Z 2025-04-09T20:09:43Z The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...]]> The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment of accelerated private cloud infrastructure. At the regional level, this demand for compute infrastructure has given rise to a new category of cloud providers who offer accelerated compute (GPU) capacity for AI workloads, also known as GPU��

Source

]]>
0
Ashish Sardana <![CDATA[Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails]]> http://www.open-lab.net/blog/?p=98456 2025-04-22T23:39:03Z 2025-04-09T20:00:00Z As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...]]> As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...

As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as hallucinations. AI guardrails��or safeguarding mechanisms enforced in AI models and applications��are a popular technique to ensure the reliability of AI applications. This post demonstrates how to build safer��

Source

]]>
0
Tyler Whitehouse <![CDATA[Just Released: NVIDIA AI Workbench 2025.03.10]]> http://www.open-lab.net/blog/?p=98549 2025-04-17T19:35:34Z 2025-04-09T18:45:41Z NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.]]> NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.

NVIDIA AI Workbench 2025.03.10 features streamlined onboarding and enhanced UX for multicontainer projects.

Source

]]>
0
Christian Munley <![CDATA[Stanford Das Lab Accelerates RNA Folding Research with NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=96840 2025-04-17T19:35:35Z 2025-04-09T16:00:00Z The Das Lab at Stanford is revolutionizing RNA folding research with a unique approach that leverages community involvement and accelerated computing. With the...]]> The Das Lab at Stanford is revolutionizing RNA folding research with a unique approach that leverages community involvement and accelerated computing. With the...Decorative image of RNA against a nucleotide letter background.

The Das Lab at Stanford is revolutionizing RNA folding research with a unique approach that leverages community involvement and accelerated computing. With the support of NVIDIA DGX Cloud through the NAIRR Pilot program, the lab gained access to 32 NVIDIA A100 DGX Cloud nodes with eight GPUs each for three months, enabling the team to transition from small-scale experiments to large-scale��

Source

]]>
0
Chris Alexiuk <![CDATA[Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models]]> http://www.open-lab.net/blog/?p=97155 2025-05-05T16:01:49Z 2025-04-08T22:05:00Z This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...]]> This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...

This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To maximize their impact, these agents need strong reasoning abilities to navigate complex problems, uncover hidden connections, and make logical decisions autonomously in dynamic environments. Due to their ability to tackle complex��

Source

]]>
0
Elias Wolfberg <![CDATA[Using AI to Better Understand the Ocean]]> http://www.open-lab.net/blog/?p=98501 2025-04-17T19:35:37Z 2025-04-08T18:04:55Z Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI.  ��We have...]]> Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI.  ��We have...An image of a robot underwater.

Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI. ��We have better maps of Mars than we do of our own exclusive economic zone,�� said Nick Rotker, chief BlueTech strategist at MITRE, a US government-sponsored nonprofit research organization. ��Around 70% of the Earth is covered in water and we��ve explored��

Source

]]>
0
Vinay Raman <![CDATA[Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?]]> http://www.open-lab.net/blog/?p=97927 2025-05-15T06:26:42Z 2025-04-07T18:39:06Z As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...]]> As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...Decorative image.

As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal point. RAG pipelines combine the generation power of LLMs with external data sources and retrieval mechanisms, enabling models to access domain-specific information that may not have existed during fine-tuning.

Source

]]>
0
Elias Wolfberg <![CDATA[Startups Use AI to Deliver Better Maternal and Newborn Care]]> http://www.open-lab.net/blog/?p=98486 2025-04-22T23:55:26Z 2025-04-07T17:55:39Z Nearly 300,000 women across the globe die each year due to complications arising from pregnancy or childbirth. The number of stillborns and babies that die...]]> Nearly 300,000 women across the globe die each year due to complications arising from pregnancy or childbirth. The number of stillborns and babies that die...An image of a women getting an ultrasound.

Nearly 300,000 women across the globe die each year due to complications arising from pregnancy or childbirth. The number of stillborns and babies that die within their first month tops nearly 4M every year. April 7 marks World Health Day, which this year focuses on raising awareness about efforts to end preventable maternal and newborn deaths. Giving women and infants better access to��

Source

]]>
0
Sama Bali <![CDATA[Event: HP & NVIDIA Developer Challenge]]> http://www.open-lab.net/blog/?p=98487 2025-04-17T19:35:39Z 2025-04-07T17:54:00Z Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.]]> Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.

Join the hackathon to build open-source AI solutions, optimize models, enhance workflows, connect with peers, and win prizes.

Source

]]>
0
Anu Srivastava <![CDATA[NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick]]> http://www.open-lab.net/blog/?p=98468 2025-04-22T23:57:03Z 2025-04-06T02:18:34Z The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...]]> The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...Decorative image of a llama in sunglasses standing on two feet, with a shadow that is flexing it's muscles.

The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can achieve over 40K output tokens per second on NVIDIA Blackwell B200 GPUs, and are available to try as NVIDIA NIM microservices. The Llama 4 models are now natively multimodal and multilingual using a mixture-of-experts (MoE) architecture.

Source

]]>
1
Matt Ahrens <![CDATA[Accelerating Apache Parquet Scans on Apache Spark with GPUs]]> http://www.open-lab.net/blog/?p=98350 2025-04-22T23:57:50Z 2025-04-03T16:18:03Z As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...]]> As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...Decorative image.

As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage format designed for efficient data processing at scale. By organizing data by columns rather than rows, Parquet enables high-performance querying and analysis, as it can read only the necessary columns for a query instead of scanning entire��

Source

]]>
3
Ashraf Eassa <![CDATA[NVIDIA Blackwell Delivers Massive Performance Leaps in MLPerf Inference v5.0]]> http://www.open-lab.net/blog/?p=98367 2025-04-23T19:41:12Z 2025-04-02T18:14:48Z The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...]]> The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency...

The compute demands for large language model (LLM) inference are growing rapidly, fueled by the combination of growing model sizes, real-time latency requirements, and, most recently, AI reasoning. At the same time, as AI adoption grows, the ability of an AI factory to serve as many users as possible, all while maintaining good per-user experiences, is key to maximizing the value it generates.

Source

]]>
0
Vinh Nguyen <![CDATA[LLM Inference Benchmarking: Fundamental Concepts]]> http://www.open-lab.net/blog/?p=98215 2025-05-09T18:23:04Z 2025-04-02T17:00:00Z This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM...]]> This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM...

This is the first post in the large language model latency-throughput benchmarking series, which aims to instruct developers on common metrics used for LLM benchmarking, fundamental concepts, and how to benchmark your LLM applications. The past few years have witnessed the rise in popularity of generative AI and large language models (LLMs), as part of a broad AI revolution.

Source

]]>
0
Elias Wolfberg <![CDATA[How AI Is Shaping Climate Innovation and Sustainable Growth]]> http://www.open-lab.net/blog/?p=98358 2025-04-17T19:35:43Z 2025-04-01T21:14:54Z At GTC 2025, a panel of industry leaders from across the tech ecosystem shared how they��re using AI to mitigate and prepare customers for the increasingly...]]> At GTC 2025, a panel of industry leaders from across the tech ecosystem shared how they��re using AI to mitigate and prepare customers for the increasingly...

At GTC 2025, a panel of industry leaders from across the tech ecosystem shared how they��re using AI to mitigate and prepare customers for the increasingly disruptive impact of climate change. Tenika Versey, the global head of sustainable futures for the NVIDIA Inception program, led a panel that included Colin le Duc, founding partner at Generation Investment Management, Suzanne DiBianca��

Source

]]>
0
Ronen Dar <![CDATA[NVIDIA Open Sources Run:ai Scheduler to Foster Community Collaboration]]> http://www.open-lab.net/blog/?p=98094 2025-04-22T23:59:16Z 2025-04-01T09:00:00Z Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....]]> Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license....

Today, NVIDIA announced the open-source release of the KAI Scheduler, a Kubernetes-native GPU scheduling solution, now available under the Apache 2.0 license. Originally developed within the Run:ai platform, KAI Scheduler is now available to the community while also continuing to be packaged and delivered as part of the NVIDIA Run:ai platform. This initiative underscores NVIDIA��s commitment to��

Source

]]>
0
Ameya Parab <![CDATA[Practical Tips for Preventing GPU Fragmentation for Volcano Scheduler]]> http://www.open-lab.net/blog/?p=98171 2025-04-03T18:44:56Z 2025-03-31T20:00:54Z At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...]]> At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...

At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA DGX Cloud-provisioned Kubernetes cluster, we stepped in to deliver a solution that not only met but exceeded expectations. By combining advanced scheduling techniques with a deep understanding of distributed workloads��

Source

]]>
0
Ashley Goldstein <![CDATA[Simulating Robots in Industrial Facility Digital Twins]]> http://www.open-lab.net/blog/?p=98201 2025-04-23T00:00:10Z 2025-03-31T16:00:00Z Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...]]> Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...

Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that include mobile robots, humanoid assistants, intelligent cameras, and AI agents throughout factories and warehouses. To harness the full potential of these physical AI enabled systems, companies rely on digital twins of their facilities��

Source

]]>
0
Brad Smith <![CDATA[A New Era in Data Center Networking with NVIDIA Silicon Photonics-based Network Switching]]> http://www.open-lab.net/blog/?p=97917 2025-04-03T18:45:19Z 2025-03-27T16:00:00Z NVIDIA is breaking new ground by integrating silicon photonics directly with its NVIDIA Quantum and NVIDIA Spectrum switch ICs. At GTC 2025, we announced the...]]> NVIDIA is breaking new ground by integrating silicon photonics directly with its NVIDIA Quantum and NVIDIA Spectrum switch ICs. At GTC 2025, we announced the...

NVIDIA is breaking new ground by integrating silicon photonics directly with its NVIDIA Quantum and NVIDIA Spectrum switch ICs. At GTC 2025, we announced the world��s most advanced Silicon Photonics Switch systems, powered by cutting-edge 200G SerDes technology. This innovation, known as co-packaged silicon photonics, delivers significant benefits such as 3.5x lower power consumption��

Source

]]>
0
Asawaree Bhide <![CDATA[R2D2: Advancing Robot Mobility and Whole-Body Control with Novel Workflows and AI Foundation Models from NVIDIA Research]]> http://www.open-lab.net/blog/?p=98193 2025-05-07T22:55:57Z 2025-03-27T15:00:00Z Welcome to the first edition of the NVIDIA Robotics Research and Development Digest (R2D2). This technical blog series will give developers and researchers...]]> Welcome to the first edition of the NVIDIA Robotics Research and Development Digest (R2D2). This technical blog series will give developers and researchers...

Welcome to the first edition of the NVIDIA Robotics Research and Development Digest (R2D2). This technical blog series will give developers and researchers deeper insight and access to the latest physical AI and robotics research breakthroughs across various NVIDIA Research labs. Developing robust robots presents significant challenges, such as: We address these challenges through��

Source

]]>
0
Arun Raman <![CDATA[Deploying the NVIDIA AI Blueprint for Cost-Efficient LLM Routing]]> http://www.open-lab.net/blog/?p=98006 2025-04-23T00:01:08Z 2025-03-26T22:01:20Z Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown...]]> Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown...

Since the release of ChatGPT in November 2022, the capabilities of large language models (LLMs) have surged, and the number of available models has grown exponentially. With this expansion, LLMs now vary widely in cost, performance, and specialization. For example, straightforward tasks like text summarization can be efficiently handled by smaller, general-purpose models. In contrast��

Source

]]>
0
Brian Shi <![CDATA[Boosting Q&A Accuracy with GraphRAG Using PyG and Graph Databases]]> http://www.open-lab.net/blog/?p=97900 2025-04-03T18:46:06Z 2025-03-26T21:41:08Z Large language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to...]]> Large language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to...Decorative image.

Large language models (LLMs) often struggle with accuracy when handling domain-specific questions, especially those requiring multi-hop reasoning or access to proprietary data. While retrieval-augmented generation (RAG) can help, traditional vector search methods often fall short. In this tutorial, we show you how to implement GraphRAG in combination with fine-tuned GNN+LLM models to achieve��

Source

]]>
0
Cole Swain <![CDATA[Spotlight: Tomorrow.io?Transforms Global Weather Resilience with NVIDIA AI]]> http://www.open-lab.net/blog/?p=98023 2025-04-03T18:46:17Z 2025-03-26T21:19:34Z From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...]]> From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather...

From hyperlocal forecasts that guide daily operations to planet-scale models illuminating new climate insights, the world is entering a new frontier in weather and climate resilience. The combination of space-based observations and GPU-accelerated AI delivers near-instant, context-rich insights to enterprises, governments, researchers, and solution providers worldwide. It also marks a rare��

Source

]]>
0
Pomi Lee <![CDATA[Just Released: Omniverse Kit 107.0]]> http://www.open-lab.net/blog/?p=98210 2025-04-03T18:46:32Z 2025-03-26T19:25:39Z Kit SDK 107.0 is a major update release with primary updates for robotics development.]]> Kit SDK 107.0 is a major update release with primary updates for robotics development.

Kit SDK 107.0 is a major update release with primary updates for robotics development.

Source

]]>
0
John Ashcroft <![CDATA[Powering Flood Risk Assessment with NVIDIA Earth-2]]> http://www.open-lab.net/blog/?p=97974 2025-04-23T00:01:57Z 2025-03-25T20:59:12Z Inland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in...]]> Inland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in...

Inland flooding causes significant economic and societal impacts annually. Of the eight natural disasters costing the insurance industry over $1 billion in 2024, six of these were categorized as flood events, with three of these occurring in Europe alone. Catastrophe modeling aims to quantify the risk of flood events to enable preparedness for the financial and insurance industries.

Source

]]>
0
Pradyumna Desale <![CDATA[Automating AI Factories with NVIDIA Mission Control]]> http://www.open-lab.net/blog/?p=98012 2025-04-03T18:47:00Z 2025-03-25T18:45:11Z Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These...]]> Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These...

Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These models can be tailored to unique use cases, tackling diverse challenges like never before. Based on the success of early AI adopters, many organizations are shifting their focus to full-scale production AI factories. Yet the process of��

Source

]]>
0
Xavier Renard <![CDATA[Spotlight: AXA Explores AI-Driven Hurricane Risk Assessment]]> http://www.open-lab.net/blog/?p=98096 2025-04-23T00:05:46Z 2025-03-25T17:47:06Z Large ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of...]]> Large ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of...

Large ensembles are essential for predicting rare, high-impact events that cannot be fully understood through historical data alone. By simulating thousands of potential scenarios, they provide the statistical depth necessary to assess risks, prepare for extremes, and build resilience against once-in-a-century disasters. Global insurance group AXA is conducting simulations with cutting-edge��

Source

]]>
0
���˳���97caoporen����