DGX Cloud – NVIDIA Technical Blog

DGX Cloud – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-10T19:19:07Z http://www.open-lab.net/blog/feed/ Jason Perlow <![CDATA[How Early Access to NVIDIA GB200 Systems Helped LMArena Build a Model to Evaluate LLMs]]> http://www.open-lab.net/blog/?p=102053 2025-06-26T18:55:16Z 2025-06-18T16:00:00Z

LMArena at the University of California, Berkeley is making it easier to see which large language models excel at specific tasks, thanks to help from NVIDIA and...]]>

LMArena at the University of California, Berkeley is making it easier to see which large language models excel at specific tasks, thanks to help from NVIDIA and...

LMA featured

LMArena at the University of California, Berkeley is making it easier to see which large language models excel at specific tasks, thanks to help from NVIDIA and Nebius. Its rankings, powered by the Prompt-to-Leaderboard (P2L) model, collect votes from humans on which AI performs best in areas such as math, coding, or creative writing. ��We capture user preferences across tasks and apply��

]]> 0 Janisha Anand <![CDATA[Introducing NVIDIA DGX Cloud Lepton: A Unified AI Platform Built for Developers]]> http://www.open-lab.net/blog/?p=101586 2025-06-12T18:48:32Z 2025-06-11T11:00:00Z

The age of AI-native applications has arrived. Developers are building advanced agentic and physical AI systems��but scaling across geographies and GPU...]]>

The age of AI-native applications has arrived. Developers are building advanced agentic and physical AI systems��but scaling across geographies and GPU...

-tw-1200x675

The age of AI-native applications has arrived. Developers are building advanced agentic and physical AI systems��but scaling across geographies and GPU providers remains a challenge. NVIDIA built DGX Cloud Lepton to help. It��s a unified AI platform and compute marketplace that connects developers to tens of thousands of GPUs from a global network of cloud providers. And it��s now available for��

]]> 0 Abhishek Sinha <![CDATA[Announcing NVIDIA Exemplar Clouds for Benchmarking AI Cloud Infrastructure]]> http://www.open-lab.net/blog/?p=100157 2025-05-29T17:30:58Z 2025-05-19T06:00:00Z

Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it��s nearly...]]>

Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it��s nearly...

cloud-between-computers

Developers and enterprises training large language models (LLMs) and deploying AI workloads in the cloud have long faced a fundamental challenge: it��s nearly impossible to know in advance if a cloud platform will deliver the performance, reliability, and cost efficiency their applications require. In this context, the difference between theoretical peak performance and actual��

]]> 0 Rucha Apte <![CDATA[Applying Specialized LLMs with Reasoning Capabilities to Accelerate Battery Research]]> http://www.open-lab.net/blog/?p=99794 2025-05-29T19:05:10Z 2025-05-09T16:00:00Z

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates...]]>

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates... An illustration showing molecules and a brain.

An illustration showing molecules and a brain.

Scientific research in complex fields like battery innovation is often slowed by manual evaluation of materials, limiting progress to just dozens of candidates per day. In this blog post, we explore how domain-adapted large language models (LLMs), enhanced with reasoning capabilities, are transforming scientific research, especially in high-stakes, complex domains like battery innovation.

]]> 0 Camden Spehl <![CDATA[Concept?Driven AI Teaching Assistant Guides Students to Deeper Insights]]> http://www.open-lab.net/blog/?p=99719 2025-05-29T19:05:17Z 2025-05-07T20:57:51Z

In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,...]]>

In today's educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information,... Students sitting around a computer.

Students sitting around a computer.

In today��s educational landscape, generative AI tools have become both a blessing and a challenge. While these tools offer unprecedented access to information, they��ve also created new concerns about academic integrity. Increasingly, students rely on AI to generate direct answers to homework questions, often at the expense of developing critical thinking skills and mastering core concepts.

]]> 2 Weiji Chen <![CDATA[New NVIDIA NV-Tesseract Time Series Models Advance Dataset Processing and Anomaly Detection]]> http://www.open-lab.net/blog/?p=99642 2025-05-29T19:05:21Z 2025-05-06T16:22:57Z

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining...]]>

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining...

dgx-cloud-social-tesseract-1480x830

Time-series data has evolved from a simple historical record into a real-time engine for critical decisions across industries. Whether it��s streamlining logistics, forecasting markets, or anticipating machine failures, organizations need more sophisticated tools than traditional methods can offer. NVIDIA GPU-accelerated deep learning is enabling industries to gain real-time analytics.

]]> 0 Brian Sparks <![CDATA[NVIDIA Helps Build AI Factories Faster Than Ever with NVIDIA DGX SuperPOD]]> http://www.open-lab.net/blog/?p=98579 2025-04-17T19:35:28Z 2025-04-11T18:35:30Z

In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by...]]>

In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by... Image of a Softbank datacenter corridor.

Image of a Softbank datacenter corridor.

In a cavernous room at an undisclosed location in Japan, a digital revolution is unfolding. Racks of servers stand like giants, their sleek frames linked by thousands of cables humming with potential. Until last year, this sprawling AI factory didn��t exist. Now it��s poised to anchor SoftBank Corporation��s vision for AI-powered innovation, a vision rooted in creating a society that coexists��

]]> 0 Elias Wolfberg <![CDATA[Using AI to Better Understand the Ocean]]> http://www.open-lab.net/blog/?p=98501 2025-04-17T19:35:37Z 2025-04-08T18:04:55Z

Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI. ��We have...]]>

Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI. ��We have... An image of a robot underwater.

An image of a robot underwater.

Humans know more about deep space than we know about Earth��s deepest oceans. But scientists have plans to change that��with the help of AI. ��We have better maps of Mars than we do of our own exclusive economic zone,�� said Nick Rotker, chief BlueTech strategist at MITRE, a US government-sponsored nonprofit research organization. ��Around 70% of the Earth is covered in water and we��ve explored��

]]> 0 Ameya Parab <![CDATA[Practical Tips for Preventing GPU Fragmentation for Volcano Scheduler]]> http://www.open-lab.net/blog/?p=98171 2025-04-03T18:44:56Z 2025-03-31T20:00:54Z

At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...]]>

At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA...

Practical Tips for Preventing GPU Fragmentation for Volcano Scheduler

At NVIDIA, we take pride in tackling complex infrastructure challenges with precision and innovation. When Volcano faced GPU underutilization in their NVIDIA DGX Cloud-provisioned Kubernetes cluster, we stepped in to deliver a solution that not only met but exceeded expectations. By combining advanced scheduling techniques with a deep understanding of distributed workloads��

]]> 0 Wen Jie Ong <![CDATA[Accelerating the Future of Transportation with SES AI��s NVIDIA-Powered Innovation for Electric Vehicles]]> http://www.open-lab.net/blog/?p=97805 2025-04-23T00:04:13Z 2025-03-25T16:00:00Z

Electric vehicles (EVs) are transforming transportation, but challenges such as cost, longevity, and range remain barriers to widespread adoption. At the heart...]]>

Electric vehicles (EVs) are transforming transportation, but challenges such as cost, longevity, and range remain barriers to widespread adoption. At the heart... Decorative image of a car at night.

Decorative image of a car at night.

Electric vehicles (EVs) are transforming transportation, but challenges such as cost, longevity, and range remain barriers to widespread adoption. At the heart of these challenges lies battery technology��specifically, the electrolyte, a critical component that enables energy storage and delivery. The electrolyte��s properties directly impact a battery��s charging speed, power output, stability��

]]> 1 Vishal Ganeriwala <![CDATA[Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference]]> http://www.open-lab.net/blog/?p=97192 2025-03-20T17:07:54Z 2025-03-18T21:22:51Z

NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...]]>

NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...

dgx-cloud-serverless-inference

NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA Cloud Functions (NVCF), DGX Cloud Serverless Inference abstracts multi-cluster infrastructure setups across multi-cloud and on-premises environments for GPU-accelerated workloads. Whether managing AI workloads��

]]> 0 Emily Potyraj <![CDATA[Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking]]> http://www.open-lab.net/blog/?p=97548 2025-05-06T17:00:29Z 2025-03-18T21:21:17Z

As AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical...]]>

As AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical...

dgx-cloud-benchmark

As AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical validation and business planning. Organizations need a better way to assess real-world, end-to-end AI workload performance and the total cost of ownership rather than just comparing raw FLOPs or hourly cost per GPU.

]]> 0 Hao Wang <![CDATA[Petabyte-Scale Video Processing with NVIDIA NeMo Curator on NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=97031 2025-03-20T17:07:03Z 2025-03-18T19:22:51Z

With the rise of physical AI, video content generation has surged exponentially. A single camera-equipped autonomous vehicle can generate more than 1 TB of...]]>

With the rise of physical AI, video content generation has surged exponentially. A single camera-equipped autonomous vehicle can generate more than 1 TB of... NeMo Video Curator icon in a workflow diagram.

NeMo Video Curator icon in a workflow diagram.

With the rise of physical AI, video content generation has surged exponentially. A single camera-equipped autonomous vehicle can generate more than 1 TB of video daily, while a robotics-powered manufacturing facility may produce 1 PB of data daily. To leverage this data for training and fine-tuning world foundation models (WFMs), you must first process it efficiently.

]]> 4 Emily Potyraj <![CDATA[NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance]]> http://www.open-lab.net/blog/?p=95558 2025-05-06T17:01:29Z 2025-02-11T17:00:00Z

In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...]]>

In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a... Three icons in a row, including DGX in the middle.

Three icons in a row, including DGX in the middle.

In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a comprehensive evaluation of the entire stack, from compute to networking to model framework. Navigating the complexities of AI system performance can be difficult. There are many application changes that you can make��

]]> 0 Martin Cimmino <![CDATA[Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with Domyn and NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=95012 2025-06-25T17:51:57Z 2025-01-16T12:00:00Z

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...]]>

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and... Stack diagram for LLM Megatron Core.

Stack diagram for LLM Megatron Core.

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and summarization. However, despite their advanced capabilities, foundation models have limitations when it comes to domain-specific expertise such as finance or healthcare or capturing cultural and language nuances beyond English.

]]> 0 Brad Nemire <![CDATA[NVIDIA Project DIGITS, A Grace Blackwell AI Supercomputer On Your Desk]]> http://www.open-lab.net/blog/?p=94765 2025-01-23T19:54:30Z 2025-01-09T18:19:00Z

Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.]]>

Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.

NVIDIA Project DIGITS

Powered by the new GB10 Grace Blackwell Superchip, Project DIGITS can tackle large generative AI models of up to 200B parameters.

]]> 5 Niels Bantilan <![CDATA[Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=81110 2024-05-08T17:57:05Z 2024-04-24T01:12:42Z

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...]]>

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of... Decorative image of different workflows against a grey background.

Decorative image of different workflows against a grey background.

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of the critical components of the AI revolution. We now rely on these workhorses to fulfill deep learning workloads, crunching through massive and complex semi-structured datasets. However, as demand for AI-based solutions has��

]]> 0 Mehran Maghoumi <![CDATA[Scale and Curate High-Quality Datasets for LLM Training with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=80168 2025-02-17T05:28:15Z 2024-03-27T18:00:00Z

Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to...]]>

Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to...

llm-tech-blog-gtc24-nemo-curator-1920x1080

Enterprises are using large language models (LLMs) as powerful tools to improve operational efficiency and drive innovation. NVIDIA NeMo microservices aim to make building and deploying models more accessible to enterprises. An important step for building any LLM system is to curate the dataset of tokens to be used for training or customizing the model. However, curating a suitable dataset��

]]> 0 Ike Nnoli <![CDATA[Generative AI for Digital Human Technologies and New AI-powered NVIDIA RTX Lighting]]> http://www.open-lab.net/blog/?p=79707 2024-12-09T16:51:28Z 2024-03-19T17:00:00Z

At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment...]]>

At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment... Still image from Covert Protocol game demo.

Still image from Covert Protocol game demo.

At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment of generative AI-powered game characters alongside updated NVIDIA RTX SDKs that simplify the creation of beautiful worlds. You can incorporate the full suite of NVIDIA digital human technologies or individual microservices into��

]]> 0 Amanda Saunders <![CDATA[NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale]]> http://www.open-lab.net/blog/?p=79467 2024-06-03T15:44:17Z 2024-03-18T22:00:00Z

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within...]]>

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within... An illustration representing NVIDIA NIM.

An illustration representing NVIDIA NIM.

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within months and drove a surge of development activities across almost every industry. By 2023, developers began POCs using APIs and open-source community models from Meta, Mistral, Stability, and more. Entering 2024��

]]> 0 Alan Nafiiev <![CDATA[Accelerating Drug Discovery at Receptor.AI with NVIDIA BioNeMo Cloud APIs]]> http://www.open-lab.net/blog/?p=77569 2024-05-08T17:57:29Z 2024-02-14T21:00:00Z

The quest for new, effective treatments for diseases that remain stubbornly resistant to current therapies is at the heart of drug discovery. This traditionally...]]>

The quest for new, effective treatments for diseases that remain stubbornly resistant to current therapies is at the heart of drug discovery. This traditionally...

stylized-3d-structure

The quest for new, effective treatments for diseases that remain stubbornly resistant to current therapies is at the heart of drug discovery. This traditionally long and expensive process has been radically improved by AI techniques like deep learning, empowered by the rise of accelerated computing. Receptor.AI, a London-based drug discovery company and NVIDIA Inception member��

]]> 0 Tanya Lenz <![CDATA[Webinar: Accelerate AV Development with NVIDIA DGX Cloud and NVIDIA AI Enterprise]]> http://www.open-lab.net/blog/?p=72286 2024-05-08T17:57:52Z 2023-10-30T20:00:00Z

Learn how to leverage NVIDIA AI-powered infrastructure and software to accelerate AV development for maximum efficiency.]]>

Learn how to leverage NVIDIA AI-powered infrastructure and software to accelerate AV development for maximum efficiency.

av-graphic

Learn how to leverage NVIDIA AI-powered infrastructure and software to accelerate AV development for maximum efficiency.

]]> 0 Joe Handzik <![CDATA[High-Performance Storage on NVIDIA DGX Cloud with Oracle Cloud Infrastructure]]> http://www.open-lab.net/blog/?p=63551 2024-05-08T17:58:47Z 2023-04-18T18:43:47Z

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay...]]>

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay... Data center

Data center

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay ahead of the curve in the current fast-paced digital environment. Speeding up access to that data is yet another way that NVIDIA accelerates entire AI workflows. NVIDIA DGX Cloud caters to a wide variety of market use cases.

]]> 0 ��˳��97caoporen��