AI Enterprise – NVIDIA Technical Blog

AI Enterprise – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-11T15:00:00Z http://www.open-lab.net/blog/feed/ Liad Levi-Raz <![CDATA[Fine-Tuning LLMOps for Rapid Model Evaluation and Ongoing Optimization]]> http://www.open-lab.net/blog/?p=102196 2025-06-26T18:57:52Z 2025-06-17T17:00:00Z

Large language models (LLMs) have created unprecedented opportunities across various industries. However, moving LLMs from research and development into...]]>

Large language models (LLMs) have created unprecedented opportunities across various industries. However, moving LLMs from research and development into... A decorative image.

A decorative image.

Large language models (LLMs) have created unprecedented opportunities across various industries. However, moving LLMs from research and development into reliable, scalable, and maintainable production systems presents unique operational challenges. LLMOps, or large language model operations, are designed to address these challenges. Building upon the principles of traditional machine��

]]> 0 Charu Chaubal <![CDATA[Securely Deploy AI Models with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=101701 2025-06-12T18:48:31Z 2025-06-11T11:00:00Z

Imagine you��re leading security for a large enterprise and your teams are eager to leverage AI for more and more projects. There��s a problem, though. As...]]>

Imagine you��re leading security for a large enterprise and your teams are eager to leverage AI for more and more projects. There��s a problem, though. As...

ai-nim-safety-graphic

Imagine you��re leading security for a large enterprise and your teams are eager to leverage AI for more and more projects. There��s a problem, though. As with any project, you must balance the promise and returns of innovation with the hard realities of compliance, risk management, and security posture mandates. Security leaders face a crucial challenge when evaluating AI models such as those��

]]> 0 Naim <![CDATA[Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)]]> http://www.open-lab.net/blog/?p=90877 2025-06-12T18:50:49Z 2025-06-03T06:00:00Z

Note: This blog post was originally published on Oct. 28, 2024, but has been edited to reflect new updates. Fraud in financial services is a massive problem....]]>

Note: This blog post was originally published on Oct. 28, 2024, but has been edited to reflect new updates. Fraud in financial services is a massive problem....

fsi-tech-blog-fraud-detection-blueprint-1920x1080

Note: This blog post was originally published on Oct. 28, 2024, but has been edited to reflect new updates. Fraud in financial services is a massive problem. Financial losses from worldwide credit card transaction fraud are expected to total $403.88 billion over the next 10 years, according to research firm the Nilson Report. While other types of fraud��such as identity theft, account takeover��

]]> 0 Kanika Atri <![CDATA[Telcos Across Five Continents Are Building NVIDIA-Powered Sovereign AI Infrastructure]]> http://www.open-lab.net/blog/?p=100828 2025-06-25T17:54:15Z 2025-05-30T16:01:21Z

AI is becoming the cornerstone of innovation across industries, driving new levels of creativity and productivity and fundamentally reshaping how we live and...]]>

AI is becoming the cornerstone of innovation across industries, driving new levels of creativity and productivity and fundamentally reshaping how we live and...

end-to-end-press-ai-factory-kv-1920x1080

AI is becoming the cornerstone of innovation across industries, driving new levels of creativity and productivity and fundamentally reshaping how we live and work. And it��s enabled by a new type of infrastructure��the AI factory��that manufactures intelligence at scale and creates the foundation for what many consider the next industrial revolution. AI factories represent a reset of traditional��

]]> 0 Alex Zeltov <![CDATA[Accelerated AI Inference with NVIDIA NIM on Azure AI Foundry]]> http://www.open-lab.net/blog/?p=99911 2025-05-29T19:05:07Z 2025-05-12T17:59:36Z

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...]]>

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with...

nim-nemo-retriever

The integration of NVIDIA NIM microservices into Azure AI Foundry marks a major leap forward in enterprise AI development. By combining NIM microservices with Azure��s scalable, secure infrastructure, organizations can now deploy powerful, ready-to-use AI models more efficiently than ever before. NIM microservices are containerized for GPU-accelerated inferencing for pretrained and customized��

]]> 0 Matheen Raza <![CDATA[Delivering NVIDIA Accelerated Computing for Enterprise AI Workloads with Rafay]]> http://www.open-lab.net/blog/?p=98533 2025-04-22T23:52:20Z 2025-04-09T20:09:43Z

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...]]>

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment...

3d-field-connected-points

The worldwide adoption of generative AI has driven massive demand for accelerated compute hardware globally. In enterprises, this has accelerated the deployment of accelerated private cloud infrastructure. At the regional level, this demand for compute infrastructure has given rise to a new category of cloud providers who offer accelerated compute (GPU) capacity for AI workloads, also known as GPU��

]]> 0 Pradyumna Desale <![CDATA[Automating AI Factories with NVIDIA Mission Control]]> http://www.open-lab.net/blog/?p=98012 2025-04-03T18:47:00Z 2025-03-25T18:45:11Z

Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These...]]>

Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These...

graphic-ai-factory

Advanced AI models such as DeepSeek-R1 are proving that enterprises can now build cutting-edge AI models specialized with their own data and expertise. These models can be tailored to unique use cases, tackling diverse challenges like never before. Based on the success of early AI adopters, many organizations are shifting their focus to full-scale production AI factories. Yet the process of��

]]> 0 Amr Elmeleegy <![CDATA[Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models]]> http://www.open-lab.net/blog/?p=95274 2025-04-23T00:15:55Z 2025-03-18T17:50:00Z

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...]]>

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...

computer-monitor-data-center-abstract

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning models in large-scale distributed environments. The framework boosts the number of requests served by up to 30x, when running the open-source DeepSeek-R1 models on NVIDIA Blackwell.

]]> 2 Leon Derczynski <![CDATA[Defining LLM Red Teaming]]> http://www.open-lab.net/blog/?p=96239 2025-04-23T02:37:15Z 2025-02-25T18:49:26Z

There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to...]]>

There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to... Decorative image.

Decorative image.

There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to deviate from acceptable standards. This use of LLMs began in 2023 and has rapidly evolved to become a common industry practice and a cornerstone of trustworthy AI. How can we standardize and define LLM red teaming?

]]> 0 Charu Chaubal <![CDATA[NVIDIA AI Enterprise Adds Support for NVIDIA H200 NVL]]> http://www.open-lab.net/blog/?p=96424 2025-04-23T02:34:39Z 2025-02-24T22:37:47Z

NVIDIA AI Enterprise is the cloud-native software platform for the development and deployment of production-grade AI solutions. The latest release of the NVIDIA...]]>

NVIDIA AI Enterprise is the cloud-native software platform for the development and deployment of production-grade AI solutions. The latest release of the NVIDIA... Collage of use case thumbnails, including avatars, imaging, and chatbots.

Collage of use case thumbnails, including avatars, imaging, and chatbots.

NVIDIA AI Enterprise is the cloud-native software platform for the development and deployment of production-grade AI solutions. The latest release of the NVIDIA AI Enterprise infrastructure software collection adds support for the latest NVIDIA data center GPU, NVIDIA H200 NVL, giving your enterprise new options for powering cutting-edge use cases such as agentic and generative AI with some of the��

]]> 0 Shruthii Sathyanarayanan <![CDATA[Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench]]> http://www.open-lab.net/blog/?p=95720 2025-04-23T02:48:08Z 2025-02-05T18:00:00Z

NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...]]>

NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...

data-science-workstation

NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a frictionless experience across PCs, workstations, servers, and cloud for AI, data science, and machine learning (ML) projects. The user experience includes: This post provides details about the January 2025 release of NVIDIA AI Workbench��

]]> 0 Martin Cimmino <![CDATA[Continued Pretraining of State-of-the-Art LLMs for Sovereign AI and Regulated Industries with Domyn and NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=95012 2025-06-25T17:51:57Z 2025-01-16T12:00:00Z

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and...]]>

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and... Stack diagram for LLM Megatron Core.

Stack diagram for LLM Megatron Core.

In recent years, large language models (LLMs) have achieved extraordinary progress in areas such as reasoning, code generation, machine translation, and summarization. However, despite their advanced capabilities, foundation models have limitations when it comes to domain-specific expertise such as finance or healthcare or capturing cultural and language nuances beyond English.

]]> 0 Anish Maddipoti <![CDATA[One-Click Deployments for the Best of NVIDIA AI with NVIDIA Launchables]]> http://www.open-lab.net/blog/?p=94569 2025-01-23T19:54:34Z 2025-01-07T04:30:00Z

AI development has become a core part of modern software engineering, and NVIDIA is committed to finding ways to bring optimized accelerated computing to every...]]>

AI development has become a core part of modern software engineering, and NVIDIA is committed to finding ways to bring optimized accelerated computing to every... An avatar working at a PC station.

AI development has become a core part of modern software engineering, and NVIDIA is committed to finding ways to bring optimized accelerated computing to every developer that wants to start experimenting with AI. To address this, we��ve been working on making the accelerated computing stack more accessible with NVIDIA Launchables: preconfigured GPU computing environments that enable you to��

]]> 0 Charu Chaubal <![CDATA[New Whitepaper: NVIDIA AI Enterprise Security]]> http://www.open-lab.net/blog/?p=94475 2024-12-20T20:56:54Z 2024-12-20T00:41:33Z

This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...]]>

This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure...

ai-enter-press-security-graphic-1920x1080

This white paper details our commitment to securing the NVIDIA AI Enterprise software stack. It outlines the processes and measures NVIDIA takes to ensure container security.

]]> 0 Sama Bali <![CDATA[A Guide to Retrieval-Augmented Generation for AEC]]> http://www.open-lab.net/blog/?p=94305 2024-12-18T17:58:35Z 2024-12-18T21:00:00Z

Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...]]>

Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,...

retrieval-augmented-generation-graphic

Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation, and data analysis. These AI-powered tools have improved how companies operate, from streamlining customer service to enhancing decision-making processes. However, despite their impressive general knowledge, LLMs often struggle with��

]]> 1 Alberto Carpentieri <![CDATA[Advancing Solar Irradiance Prediction with NVIDIA Earth-2]]> http://www.open-lab.net/blog/?p=93596 2025-01-07T20:18:50Z 2024-12-12T17:57:29Z

As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...]]>

As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce...

earth-from-space-electricity

As global electricity demand continues to rise, traditional sources of energy are increasingly unsustainable. Energy providers are facing pressure to reduce reliance on fossil fuels while ensuring a fully supplied and stable grid. In this context, solar energy has emerged as a vital renewable resource, being one of the most abundant clean energy sources available. However��

]]> 0 Leigh Engel <![CDATA[Deploying NVIDIA H200 NVL at Scale with New Enterprise Reference Architecture]]> http://www.open-lab.net/blog/?p=93686 2024-12-12T19:35:14Z 2024-12-12T00:40:45Z

Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...]]>

Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for...

nvidia-h200-nvl

Last month at the Supercomputing 2024 conference, NVIDIA announced the availability of NVIDIA H200 NVL, the latest NVIDIA Hopper platform. Optimized for enterprise workloads, NVIDIA H200 NVL is a versatile platform that delivers accelerated performance for a wide range of AI and HPC applications. With its dual-slot PCIe form-factor and 600W TGP, the H200 NVL enables flexible configuration options��

]]> 0 Manoj C R <![CDATA[Spotlight: TCS Increases Automotive Software Testing Speeds by 2x Using NVIDIA Generative AI]]> http://www.open-lab.net/blog/?p=92444 2024-12-12T19:38:36Z 2024-11-22T20:07:53Z

Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....]]>

Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety....

highway-traffic

Generative AI is transforming every aspect of the automotive industry, including software development, testing, user experience, personalization, and safety. With the automotive industry shifting from a mechanically driven approach to a software-driven one, generative AI is unlocking a world of possibilities. Tata Consultancy Services (TCS) focuses on two major segments for leveraging��

]]> 0 Zenodia Charpy <![CDATA[Build Your First Human-in-the-Loop AI Agent with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=91339 2024-12-12T19:38:38Z 2024-11-21T22:45:13Z

AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to...]]>

AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to...

desktop-computer-text-bubbles-onscreen

AI agents powered by large language models (LLMs) help organizations streamline and reduce manual workloads. These agents use multilevel, iterative reasoning to analyze problems, devise solutions, and execute tasks with various tools. Unlike traditional chatbots, LLM-powered agents automate complex tasks by effectively understanding and processing information. To avoid potential risks in specific��

]]> 20 Samuel Ochoa <![CDATA[Build Multimodal Visual AI Agents Powered by NVIDIA NIM]]> http://www.open-lab.net/blog/?p=90989 2024-11-14T19:40:37Z 2024-10-31T20:20:01Z

The exponential growth of visual data��ranging from images to PDFs to streaming videos��has made manual review and analysis virtually impossible....]]>

The exponential growth of visual data��ranging from images to PDFs to streaming videos��has made manual review and analysis virtually impossible.... Decorative image.

Decorative image.

The exponential growth of visual data��ranging from images to PDFs to streaming videos��has made manual review and analysis virtually impossible. Organizations are struggling to transform this data into actionable insights at scale, leading to missed opportunities and increased risks. To solve this challenge, vision-language models (VLMs) are emerging as powerful tools��

]]> 0 Charu Chaubal <![CDATA[Enhanced Security and Streamlined Deployment of AI Agents with NVIDIA AI Enterprise]]> http://www.open-lab.net/blog/?p=90647 2024-11-27T18:39:53Z 2024-10-29T16:00:00Z

AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more...]]>

AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more... NVIDIA AI Enterprise use cases as cards on a black background, with the logo in front.

AI agents are emerging as the newest way for organizations to increase efficiency, improve productivity, and accelerate innovation. These agents are more advanced than prior AI applications, with the ability to autonomously reason through tasks, call out to other tools, and incorporate both enterprise data and employee knowledge to produce valuable business outcomes. They��re being embedded into��

]]> 0 Anurag Guda https://www.linkedin.com/in/anuragguda/ <![CDATA[Simplify AI Application Development with NVIDIA Cloud Native Stack]]> http://www.open-lab.net/blog/?p=89970 2024-10-29T21:00:38Z 2024-10-16T16:00:00Z

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...]]>

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional... NCNS logo on a black background.

In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional infrastructure can often struggle to meet the demands of modern AI workloads, leading to bottlenecks in development and deployment processes. As organizations strive to deploy AI models and data-intensive applications at scale��

]]> 0 Nicola Sessions <![CDATA[DataStax Announces New AI Development Platform, Built with NVIDIA AI]]> http://www.open-lab.net/blog/?p=90307 2025-02-17T05:27:03Z 2024-10-15T13:00:00Z

As enterprises increasingly adopt AI technologies, they face a complex challenge of efficiently developing, securing, and continuously improving AI applications...]]>

As enterprises increasingly adopt AI technologies, they face a complex challenge of efficiently developing, securing, and continuously improving AI applications...

DataStax AI Platform Built by NVIDIA

As enterprises increasingly adopt AI technologies, they face a complex challenge of efficiently developing, securing, and continuously improving AI applications to leverage their data assets. They need a unified, end-to-end solution that simplifies AI development, enhances security, and enables continuous optimization, allowing organizations to harness the full potential of their data for AI��

]]> 0 Soma Velayutham <![CDATA[Bringing AI-RAN to a Telco Near You]]> http://www.open-lab.net/blog/?p=89920 2024-11-12T04:34:20Z 2024-10-08T14:00:00Z

Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that...]]>

Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that... Image of the GB200 NVL2 superchip.

Image of the GB200 NVL2 superchip.

Inferencing for generative AI and AI agents will drive the need for AI compute infrastructure to be distributed from edge to central clouds. IDC predicts that ��Business AI (consumer excluded) will contribute $19.9 trillion to the global economy and account for 3.5% of GDP by 2030.�� 5G networks must also evolve to serve this new incoming AI traffic. At the same time, there is an opportunity��

]]> 0 Amit Bleiweiss <![CDATA[Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas]]> http://www.open-lab.net/blog/?p=89625 2024-11-07T23:29:42Z 2024-10-01T16:00:00Z

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...]]>

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such... Avatars of a patient in a bed with a doctor sitting at a desk in another location, looking at a computer screen.

Avatars of a patient in a bed with a doctor sitting at a desk in another location, looking at a computer screen.

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such innovation is retrieval-augmented generation (RAG), which is transforming how medical information is processed and used. RAG combines the capabilities of large language models (LLMs) with external knowledge retrieval��

]]> 0 Vinay Bagade <![CDATA[Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint]]> http://www.open-lab.net/blog/?p=89345 2024-10-22T20:34:33Z 2024-09-25T20:30:00Z

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...]]>

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to...

digital-human-interface-representation

Providing customers with quality service remains a top priority for businesses across industries, from answering questions and troubleshooting issues to facilitating online orders. As businesses scale operations and expand offerings globally to compete, the demand for seamless customer service grows exponentially. Searching knowledge base articles or navigating complex phone trees can be a��

]]> 0 Anjali Shah <![CDATA[Deploying Accelerated Llama 3.2 from the Edge to the Cloud]]> http://www.open-lab.net/blog/?p=89436 2024-11-07T05:08:12Z 2024-09-25T18:39:49Z

Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...]]>

Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an...

llama-3.2-graphic.

Expanding the open-source Meta Llama collection of models, the Llama 3.2 collection includes vision language models (VLMs), small language models (SLMs), and an updated Llama Guard model with support for vision. When paired with the NVIDIA accelerated computing platform, Llama 3.2 offers developers, researchers, and enterprises valuable new capabilities and optimizations to realize their��

]]> 0 Nefeli Moridis <![CDATA[Spotlight: SLB and NVIDIA Collaborate on Generative AI Solutions for Energy]]> http://www.open-lab.net/blog/?p=89133 2024-10-09T19:59:39Z 2024-09-19T16:00:00Z

Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...]]>

Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI...

professionals-viewing-flowchart

Global energy technology company SLB has announced the next milestone in its long-standing collaboration with NVIDIA to develop and scale generative AI solutions for the energy industry. The collaboration accelerates the development and deployment of energy industry-specific generative AI foundation models across SLB global platforms, including its Delfi digital platform and SLB��s new Lumi��

]]> 0 Amit Bleiweiss <![CDATA[Spotlight: xpander AI Equips NVIDIA NIM Applications with Agentic Tools]]> http://www.open-lab.net/blog/?p=88694 2024-09-19T19:31:22Z 2024-09-11T17:21:53Z

Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...]]>

Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...

graphic-representation-nvidia-nim-microservices

Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time data, perform actions, and interact with external systems, developers can bridge the gap to new, real-world use cases that significantly enhance productivity and the user experience. xpander AI, a member of the NVIDIA Inception program for��

]]> 0 Anjali Shah <![CDATA[Boosting Llama 3.1 405B Performance up to 1.44x with NVIDIA TensorRT Model Optimizer on NVIDIA H200 GPUs]]> http://www.open-lab.net/blog/?p=88017 2024-11-14T15:58:41Z 2024-08-28T19:30:00Z

The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a...]]>

The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a...

llama-sunglasses-meadow

The Llama 3.1 405B large language model (LLM), developed by Meta, is an open-source community model that delivers state-of-the-art performance and supports a variety of use cases. With 405 billion parameters and support for context lengths of up to 128K tokens, Llama 3.1 405B is also one of the most demanding LLMs to run. To deliver both low latency to optimize the user experience and high��

]]> 1 Rajvir Singh <![CDATA[Optimizing Inference Efficiency for LLMs at Scale with NVIDIA NIM Microservices]]> http://www.open-lab.net/blog/?p=87091 2024-08-22T18:24:55Z 2024-08-14T19:30:00Z

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize...]]>

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize...

llm-model-icons

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize throughput to lower operational costs and minimize latency to deliver superior user experiences. This post discusses the critical performance metrics of throughput and latency for LLMs, exploring their importance and trade-offs between��

]]> 0 Anjali Shah <![CDATA[Supercharging Llama 3.1 across NVIDIA Platforms]]> http://www.open-lab.net/blog/?p=85678 2025-02-17T05:23:06Z 2024-07-23T15:15:00Z

Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases....]]>

Meta's Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases.... Decorative image of a llama in cool sunglasses against a sunny landscape.

Decorative image of a llama in cool sunglasses against a sunny landscape.

Meta��s Llama collection of large language models are the most popular foundation models in the open-source community today, supporting a variety of use cases. Millions of developers worldwide are building derivative models, and are integrating these into their applications. With Llama 3.1, Meta is launching a suite of large language models (LLMs) as well as a suite of trust and safety models��

]]> 13 Shruthii Sathyanarayanan <![CDATA[Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG]]> http://www.open-lab.net/blog/?p=85003 2024-07-25T18:19:04Z 2024-07-11T18:00:00Z

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...]]>

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...

rag-representation

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic data. Generative AI is revolutionizing multiple industries by enabling rapid creation of content, powering intelligent knowledge assistants, augmenting software development with coding co-pilots, and automating complex tasks across various��

]]> 0 Tanay Varshney <![CDATA[NVIDIA Text Embedding Model Tops MTEB Leaderboard]]> http://www.open-lab.net/blog/?p=83571 2024-10-28T21:57:46Z 2024-06-10T17:00:00Z

The latest embedding model from NVIDIA��NV-Embed��set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...]]>

The latest embedding model from NVIDIA��NV-Embed��set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark... An illustration representing an embedding model.

An illustration representing an embedding model.

The latest embedding model from NVIDIA��NV-Embed��set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark (MTEB), which covers 56 embedding tasks. Highly accurate and effective models like NV-Embed are key to transforming vast amounts of data into actionable insights. NVIDIA provides top-performing models through the NVIDIA API catalog.

]]> 0 James McKenna <![CDATA[Pegatron Simulates and Optimizes Factory Operations with AI-Enabled Digital Twins]]> http://www.open-lab.net/blog/?p=82997 2024-06-13T19:06:07Z 2024-06-02T12:30:00Z

Manufacturers face increased pressures to shorten production cycles, enhance productivity, and improve quality, all while reducing costs. To address these...]]>

Manufacturers face increased pressures to shorten production cycles, enhance productivity, and improve quality, all while reducing costs. To address these... A factory digital twin rendering.

A factory digital twin rendering.

Manufacturers face increased pressures to shorten production cycles, enhance productivity, and improve quality, all while reducing costs. To address these challenges, they��re investing in industrial digitalization and AI-enabled digital twins to unlock new possibilities from planning to operations. Developers at Pegatron, an electronics manufacturer based in Taiwan, used NVIDIA AI��

]]> 0 Jesse Clayton <![CDATA[Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs]]> http://www.open-lab.net/blog/?p=83165 2024-11-14T16:10:37Z 2024-06-02T12:30:00Z

NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...]]>

NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...

ai-model-development-graphic

NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models for Windows applications. It��s free to use, doesn��t require prior experience with AI frameworks and development tools, and delivers the best AI performance for both local and cloud deployments. The wide availability of generative��

]]> 0 Andr�� Franklin <![CDATA[Speed Up Your AI Development: NVIDIA AI Workbench Goes GA]]> http://www.open-lab.net/blog/?p=79478 2024-04-09T23:45:11Z 2024-03-21T16:00:00Z

NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for...]]>

NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for... Illustration representing AI Workbench.

Illustration representing AI Workbench.

NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for novice developers and makes experts more productive. Developers can experience a fast and reliable GPU environment setup and the freedom to work, manage, and collaborate across heterogeneous platforms regardless of skill level.

]]> 1 Suhas Hariharapura Sheshadri https://www.linkedin.com/in/suhassheshadri/ <![CDATA[Powering Mission-Critical AI at the Edge with NVIDIA AI Enterprise IGX]]> http://www.open-lab.net/blog/?p=77966 2024-04-09T23:45:14Z 2024-03-20T17:00:00Z

NVIDIA SDKs have been instrumental in accelerating AI applications across a spectrum of use cases spanning smart cities, medical, and robotics. However,...]]>

NVIDIA SDKs have been instrumental in accelerating AI applications across a spectrum of use cases spanning smart cities, medical, and robotics. However,... Collage of product image plus three use case images.

Collage of product image plus three use case images.

NVIDIA SDKs have been instrumental in accelerating AI applications across a spectrum of use cases spanning smart cities, medical, and robotics. However, achieving a production-grade AI solution that can deployed at the edge to support human and machine collaboration safely and securely needs both high-quality hardware and software tailored for enterprise needs. NVIDIA is again accelerating��

]]> 0 Amanda Saunders <![CDATA[NVIDIA NIM Offers Optimized Inference Microservices for Deploying AI Models at Scale]]> http://www.open-lab.net/blog/?p=79467 2024-06-03T15:44:17Z 2024-03-18T22:00:00Z

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within...]]>

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within... An illustration representing NVIDIA NIM.

An illustration representing NVIDIA NIM.

The rise in generative AI adoption has been remarkable. Catalyzed by the launch of OpenAI��s ChatGPT in 2022, the new technology amassed over 100M users within months and drove a surge of development activities across almost every industry. By 2023, developers began POCs using APIs and open-source community models from Meta, Mistral, Stability, and more. Entering 2024��

]]> 0 Jacob Liberman <![CDATA[How to Take a RAG Application from Pilot to Production in Four Steps]]> http://www.open-lab.net/blog/?p=79558 2024-10-28T21:58:37Z 2024-03-18T22:00:00Z

Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve...]]>

Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve...

graphic-with-cloud-computer-text-woman

Generative AI has the potential to transform every industry. Human workers are already using large language models (LLMs) to explain, reason about, and solve difficult cognitive tasks. Retrieval-augmented generation (RAG) connects LLMs to data, expanding the usefulness of LLMs by giving them access to up-to-date and accurate information. Many enterprises have already started to explore how��

]]> 0 Amr Elmeleegy <![CDATA[Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform]]> http://www.open-lab.net/blog/?p=78388 2025-03-18T18:31:44Z 2024-03-07T19:05:46Z

Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...]]>

Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by... Four images of products against enhanced backgrounds.

Four images of products against enhanced backgrounds.

As of March 18, 2025, NVIDIA Triton Inference Server is now part of the NVIDIA Dynamo Platform and has been renamed to NVIDIA Dynamo Triton, accordingly. Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by iteratively shaping random noise into AI-generated art through denoising diffusion��

]]> 1 Chintan Patel <![CDATA[Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models]]> http://www.open-lab.net/blog/?p=78769 2024-05-07T16:50:32Z 2024-03-04T21:22:47Z

This week��s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...]]>

This week��s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...

llm-chatbot-graphic

This week��s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation Models and Endpoints are a curated set of community and NVIDIA-built generative AI models to experience, customize, and deploy in enterprise applications. Try leading models such as Nemotron-3, Mixtral 8x7B, Gemma 7B��

]]> 0 Chintan Patel <![CDATA[Unlock the Power of Small Language Model Phi-2 for Chat, Research, Coding, and More]]> http://www.open-lab.net/blog/?p=78402 2024-06-06T14:55:12Z 2024-02-27T18:00:39Z

This week��s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks....]]>

This week��s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks....

five-icons-blue-background

This week��s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks. You can experience Phi-2 directly from your browser. NVIDIA AI Foundation Models and Endpoints are a curated set of community and NVIDIA-built generative AI models to experience, customize, and deploy in enterprise applications.

]]> 0 Chintan Patel <![CDATA[Generate Code, Answer Queries, and Translate Text with New NVIDIA AI Foundation Models]]> http://www.open-lab.net/blog/?p=77364 2024-05-07T19:14:10Z 2024-02-05T18:48:17Z

This week��s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser....]]>

This week��s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser....

ai-image-generation-graphic

This week��s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser. With NVIDIA AI Foundation Models and Endpoints, you can access a curated set of community and NVIDIA-built generative AI models to experience, customize, and deploy in enterprise applications. Meta��s Code Llama 70B is the latest��

]]> 0 Phoebe Lee <![CDATA[Advancing Production AI with NVIDIA AI Enterprise]]> http://www.open-lab.net/blog/?p=76666 2024-02-08T18:51:56Z 2024-01-25T18:00:00Z

While harnessing the potential of AI is a priority for many of today��s enterprises, developing and deploying an AI model involves time and effort. Often,...]]>

While harnessing the potential of AI is a priority for many of today��s enterprises, developing and deploying an AI model involves time and effort. Often,...

nvidia-ai-enterprise-production-branch-graphic

While harnessing the potential of AI is a priority for many of today��s enterprises, developing and deploying an AI model involves time and effort. Often, challenges must be overcome to move a model into production, especially for mission-critical business operations. According to IDC research, only 18% of enterprises surveyed could put an AI model into production in under a month.

]]> 0 Nirmal Kumar Juluru <![CDATA[Build Enterprise-Grade AI with NVIDIA AI Software]]> http://www.open-lab.net/blog/?p=76978 2024-02-08T18:51:56Z 2024-01-24T20:30:00Z

Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their...]]>

Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their...

nvidia-production-ai-graphic

Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their workflows. As this adoption accelerates, it becomes imperative for enterprises not only to keep pace with the rapid advancements in AI, but also address related challenges such as optimization, scalability, and security.

]]> 0 Yasmina Benkhoui <![CDATA[Spotlight: Convai Reinvents Non-Playable Character Interactions]]> http://www.open-lab.net/blog/?p=76184 2024-01-25T18:17:40Z 2024-01-08T16:30:00Z

Convai is a versatile developer platform for designing characters with advanced multimodal perception abilities. These characters are designed to integrate...]]>

Convai is a versatile developer platform for designing characters with advanced multimodal perception abilities. These characters are designed to integrate...

two-characters-in-front-of-buildings

Convai is a versatile developer platform for designing characters with advanced multimodal perception abilities. These characters are designed to integrate seamlessly into both the virtual and real worlds. Whether you��re a creator, game designer, or developer, Convai enables you to quickly modify a non-playable character (NPC), from backstory and knowledge to voice and personality.

]]> 0 Eddie Mattia <![CDATA[Develop ML and AI with Metaflow and Deploy with NVIDIA Triton Inference Server]]> http://www.open-lab.net/blog/?p=75817 2024-01-11T19:49:34Z 2024-01-05T19:23:39Z

There are many ways to deploy ML models to production. Sometimes, a model is run once per day to refresh forecasts in a database. Sometimes, it powers a...]]>

There are many ways to deploy ML models to production. Sometimes, a model is run once per day to refresh forecasts in a database. Sometimes, it powers a...

metaflow-featured

There are many ways to deploy ML models to production. Sometimes, a model is run once per day to refresh forecasts in a database. Sometimes, it powers a small-scale but critical decision-making dashboard or speech-to-text on a mobile device. These days, the model can also be a custom large language model (LLM) backing a novel AI-driven product experience. Often, the model is exposed to its��

]]> 1 Nik Spirin <![CDATA[Mastering LLM Techniques: LLMOps]]> http://www.open-lab.net/blog/?p=73575 2023-12-08T18:53:36Z 2023-11-15T18:00:00Z

Businesses rely more than ever on data and AI to innovate, offer value to customers, and stay competitive. The adoption of machine learning (ML), created a need...]]>

Businesses rely more than ever on data and AI to innovate, offer value to customers, and stay competitive. The adoption of machine learning (ML), created a need... Illustration representing LLMOps.

Illustration representing LLMOps.

Businesses rely more than ever on data and AI to innovate, offer value to customers, and stay competitive. The adoption of machine learning (ML), created a need for tools, processes, and organizational principles to manage code, data, and models that work reliably, cost-effectively, and at scale. This is broadly known as machine learning operations (MLOps). The world is venturing rapidly into��

]]> 0 Phoebe Lee <![CDATA[Power Your Business with NVIDIA AI Enterprise 4.0 for Production-Ready Generative AI]]> http://www.open-lab.net/blog/?p=70509 2024-05-02T16:47:06Z 2023-09-12T21:00:00Z

Crossing the chasm and reaching its iPhone moment, generative AI must scale to fulfill exponentially increasing demands. Reliability and uptime are critical for...]]>

Crossing the chasm and reaching its iPhone moment, generative AI must scale to fulfill exponentially increasing demands. Reliability and uptime are critical for... Workflow examples.

Workflow examples.

Crossing the chasm and reaching its iPhone moment, generative AI must scale to fulfill exponentially increasing demands. Reliability and uptime are critical for building generative AI at the enterprise level, especially when AI is core to conducting business operations. NVIDIA is investing its expertise into building a solution for those enterprises ready to take the leap.

]]> 0 Michael Balint <![CDATA[Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning]]> http://www.open-lab.net/blog/?p=66016 2023-06-14T19:45:42Z 2023-06-02T18:08:43Z

AI is transforming industries, automating processes, and opening new opportunities for innovation in the rapidly evolving technological landscape. As more...]]>

AI is transforming industries, automating processes, and opening new opportunities for innovation in the rapidly evolving technological landscape. As more...

ml-graphic

AI is transforming industries, automating processes, and opening new opportunities for innovation in the rapidly evolving technological landscape. As more businesses recognize the value of incorporating AI into their operations, they face the challenge of implementing these technologies efficiently, effectively, and reliably. Enter NVIDIA AI Enterprise, a comprehensive software suite��

]]> 0 Steve Lee <![CDATA[Decentralizing AI with a Liquid-Cooled Development Platform by Supermicro and NVIDIA]]> http://www.open-lab.net/blog/?p=65800 2023-06-09T20:19:29Z 2023-05-31T16:00:00Z

AI is the topic of conversation around the world in 2023. It is rapidly being adopted by all industries including media, entertainment, and broadcasting. To be...]]>

AI is the topic of conversation around the world in 2023. It is rapidly being adopted by all industries including media, entertainment, and broadcasting. To be... Photo of hardware system from Supermicro.

Photo of hardware system from Supermicro.

AI is the topic of conversation around the world in 2023. It is rapidly being adopted by all industries including media, entertainment, and broadcasting. To be successful in 2023 and beyond, companies and agencies must embrace and deploy AI more rapidly than ever before. The capabilities of new AI programs like video analytics, ChatGPT, recommenders, speech recognition, and customer service are��

]]> 0 Pradyumna Desale <![CDATA[Announcing NVIDIA DGX GH200: The First 100 Terabyte GPU Memory System]]> http://www.open-lab.net/blog/?p=65526 2023-12-06T22:09:47Z 2023-05-29T03:30:00Z

At COMPUTEX 2023, NVIDIA announced the NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI...]]>

At COMPUTEX 2023, NVIDIA announced the NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI...

nvidia-dgx-gh200

At COMPUTEX 2023, NVIDIA announced the NVIDIA DGX GH200, which marks another breakthrough in GPU-accelerated computing to power the most demanding giant AI workloads. In addition to describing critical aspects of the NVIDIA DGX GH200 architecture, this post discusses how NVIDIA Base Command enables rapid deployment, accelerates the onboarding of users, and simplifies system management.

]]> 0 Joe Handzik <![CDATA[High-Performance Storage on NVIDIA DGX Cloud with Oracle Cloud Infrastructure]]> http://www.open-lab.net/blog/?p=63551 2024-05-08T17:58:47Z 2023-04-18T18:43:47Z

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay...]]>

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay... Data center

Data center

The incredible advances of accelerated computing are powered by data. The role of data in accelerating AI workloads is crucial for businesses looking to stay ahead of the curve in the current fast-paced digital environment. Speeding up access to that data is yet another way that NVIDIA accelerates entire AI workflows. NVIDIA DGX Cloud caters to a wide variety of market use cases.

]]> 0 Shashank Gaur <![CDATA[Topic Modeling and Image Classification with Dataiku and NVIDIA Data Science]]> http://www.open-lab.net/blog/?p=62857 2023-11-03T07:15:04Z 2023-04-04T18:30:00Z

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language...]]>

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language...

Twitter topic model Dataiku diagram

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language processing (NLP). Dataiku helps you with labeling, model training, explainability, model deployment, and centralized management of code and code environments. This post dives into high-level Dataiku and NVIDIA integrations for image��

]]> 0 Charu Chaubal <![CDATA[NVIDIA-Certified Next-Generation Computing Platforms for AI, Video, and Data Analytics Performance]]> http://www.open-lab.net/blog/?p=62293 2023-05-24T00:03:47Z 2023-03-22T16:00:00Z

The business applications of GPU-accelerated computing are set to expand greatly in the coming years. One of the fastest-growing trends is the use of generative...]]>

The business applications of GPU-accelerated computing are set to expand greatly in the coming years. One of the fastest-growing trends is the use of generative... Next-generation computing graphic

Next-generation computing graphic

The business applications of GPU-accelerated computing are set to expand greatly in the coming years. One of the fastest-growing trends is the use of generative AI for creating human-like text and all types of images. Driving the explosion of market interest in generative AI are technologies such as transformer models that bring AI to everyday applications, from conversational text to��

]]> 0 Phoebe Lee <![CDATA[Catapulting Enterprises to the Leading Edge of AI with NVIDIA AI Enterprise 3.1]]> http://www.open-lab.net/blog/?p=62231 2023-03-23T17:12:12Z 2023-03-21T16:31:22Z

Generative AI has marked an important milestone in the AI revolution journey. We are at a fundamental breaking point where enterprises are not only getting...]]>

Generative AI has marked an important milestone in the AI revolution journey. We are at a fundamental breaking point where enterprises are not only getting... An illustration of production AI workflow applications such as video conferencing, computer vision, and robotics.

An illustration of production AI workflow applications such as video conferencing, computer vision, and robotics.

Generative AI has marked an important milestone in the AI revolution journey. We are at a fundamental breaking point where enterprises are not only getting their feet wet but jumping into the deep end. With over 50 frameworks, pretrained models, and development tools, NVIDIA AI Enterprise, the software layer of the NVIDIA AI platform, is designed to accelerate enterprises to the leading edge��

]]> 0 Manish Harsh <![CDATA[Scaling AI with MLOps and the NVIDIA Partner Ecosystem]]> http://www.open-lab.net/blog/?p=61612 2023-06-09T22:36:44Z 2023-03-08T23:29:40Z

AI is impacting every industry, from improving customer service and streamlining supply chains to accelerating cancer research. As enterprises invest in...]]>

AI is impacting every industry, from improving customer service and streamlining supply chains to accelerating cancer research. As enterprises invest in... Data, Train, Monitor, Deploy.

Data, Train, Monitor, Deploy.

AI is impacting every industry, from improving customer service and streamlining supply chains to accelerating cancer research. As enterprises invest in AI to stay ahead of the competition, they often struggle with finding the strategy and infrastructure for success. Many AI projects are rapidly evolving, which makes production at scale especially challenging. We believe in developing��

]]> 1 Michelle Horton <![CDATA[Top MLOps Sessions at NVIDIA GTC 2023]]> http://www.open-lab.net/blog/?p=61275 2023-06-09T22:39:40Z 2023-02-23T21:17:28Z

Discover how to build a robust MLOps practice for continuous delivery and automated deployment of AI workloads at scale. ]]>

Discover how to build a robust MLOps practice for continuous delivery and automated deployment of AI workloads at scale. A collage of 4 illustrations of: a city with vehicles with object detection, a person interacting with a virtual assistant, a wirelessly connected city, and a robotic hand holding an object.

A collage of 4 illustrations of: a city with vehicles with object detection, a person interacting with a virtual assistant, a wirelessly connected city, and a robotic hand holding an object.

Discover how to build a robust MLOps practice for continuous delivery and automated deployment of AI workloads at scale.

]]> 0 Don Johnson <![CDATA[Evaluating Hidden Costs When Building or Buying an Edge Management Platform]]> http://www.open-lab.net/blog/?p=58678 2023-06-12T08:21:03Z 2022-12-20T17:00:00Z

Edge computing and edge AI are powering the digital transformation of business processes. But, as a growing field, there are still many questions about what...]]>

Edge computing and edge AI are powering the digital transformation of business processes. But, as a growing field, there are still many questions about what...

Build-vs-Buy-Edge

Edge computing and edge AI are powering the digital transformation of business processes. But, as a growing field, there are still many questions about what exactly needs to be in an edge management platform. The benefits of edge computing include low latency for real-time responses, using local area networks for higher bandwidth, and storage at lower costs compared to cloud computing.

]]> 0 Tim Lustig <![CDATA[Hands-on Access to VMware vSphere on NVIDIA BlueField DPUs with NVIDIA LaunchPad]]> http://www.open-lab.net/blog/?p=51428 2023-11-02T20:28:42Z 2022-12-06T17:00:29Z

Two years ago, NVIDIA and VMware announced that they would reimagine and re-architect the data center. Hundreds of engineers dedicated across each company have...]]>

Two years ago, NVIDIA and VMware announced that they would reimagine and re-architect the data center. Hundreds of engineers dedicated across each company have...

vsphere-bluefield

Two years ago, NVIDIA and VMware announced that they would reimagine and re-architect the data center. Hundreds of engineers dedicated across each company have worked closely to bring this joint solution to fruition. NVIDIA announces the availability of VMware vSphere on the NVIDIA BlueField DPU, providing the ideal solution for delivering a software-defined��

]]> 0 Jo?o Kluck Gomes <![CDATA[Enabling Enterprise AI Transformations for Telcos with NVIDIA and VMware]]> http://www.open-lab.net/blog/?p=55799 2022-11-07T19:56:36Z 2022-11-04T20:40:00Z

AI has the power to transform every industry, but transformation takes time, and it's rarely easy. For enterprises across industries to be as successful as...]]>

AI has the power to transform every industry, but transformation takes time, and it's rarely easy. For enterprises across industries to be as successful as...

telco-social-adobe-stock-275745221-2048x1024

AI has the power to transform every industry, but transformation takes time, and it��s rarely easy. For enterprises across industries to be as successful as possible in their own transformations, they need access to AI-ready technology platforms. They also must be able to use 5G connectivity at the edge to harness valuable data and inform their AI and ML models. Sign up for the latest��

]]> 0 Phoebe Lee <![CDATA[What��s New in NVIDIA AI Enterprise 2.1]]> http://www.open-lab.net/blog/?p=50612 2023-06-12T09:13:01Z 2022-07-25T15:00:00Z

Today, NVIDIA announced general availability of NVIDIA AI Enterprise 2.1. This latest version of the end-to-end AI and data analytics software suite is...]]>

Today, NVIDIA announced general availability of NVIDIA AI Enterprise 2.1. This latest version of the end-to-end AI and data analytics software suite is...

gtc22-spring-nvidia-ai-enterprise-2.0-launch-press-1920x1080

Today, NVIDIA announced general availability of NVIDIA AI Enterprise 2.1. This latest version of the end-to-end AI and data analytics software suite is optimized, certified, and supported for enterprises to deploy and scale AI applications across bare metal, virtual, container, and cloud environments. The NVIDIA AI Enterprise 2.1 release offers advanced data science with the latest��

]]> 0 Judy McConnell <![CDATA[How to Evaluate AI in Your Vendor��s Cybersecurity Solution]]> http://www.open-lab.net/blog/?p=49394 2023-07-11T23:10:48Z 2022-06-24T17:19:15Z

Cybersecurity software is getting more sophisticated these days, thanks to AI and ML capabilities. It��s now possible to automate security measures without...]]>

Cybersecurity software is getting more sophisticated these days, thanks to AI and ML capabilities. It��s now possible to automate security measures without...

Cybersecurity software is getting more sophisticated these days, thanks to AI and ML capabilities. It��s now possible to automate security measures without direct human intervention. The value in these powerful solutions is real��in stopping breaches, providing highly detailed alerts, and protecting attack surfaces. Still, it pays to be a skeptic. This interview with NVIDIA experts Bartley��

]]> 0 Charu Chaubal <![CDATA[Choosing a Server for Deep Learning Training]]> http://www.open-lab.net/blog/?p=46092 2023-06-12T20:53:59Z 2022-04-06T06:07:33Z

Deep learning has come to mean the most common implementation of a neural network for performing many AI tasks. Data scientists use software frameworks such as...]]>

Deep learning has come to mean the most common implementation of a neural network for performing many AI tasks. Data scientists use software frameworks such as...

2021-nvidia-corporate-key-visual

Deep learning has come to mean the most common implementation of a neural network for performing many AI tasks. Data scientists use software frameworks such as TensorFlow and PyTorch to develop and run DL algorithms. By this point, there has been a lot written about deep learning, and you can find more detailed information from many sources. For a good high-level summary, see What��s the��

]]> 0 Charu Chaubal <![CDATA[Build Mainstream Servers for AI Training and 5G with the NVIDIA H100 CNX]]> http://www.open-lab.net/blog/?p=46115 2023-06-12T20:53:45Z 2022-03-30T16:00:00Z

There is an ongoing demand for servers with the ability to transfer data from the network to a GPU at ever faster speeds. As AI models keep getting bigger, the...]]>

There is an ongoing demand for servers with the ability to transfer data from the network to a GPU at ever faster speeds. As AI models keep getting bigger, the...

h100-cnx-1920x1080

There is an ongoing demand for servers with the ability to transfer data from the network to a GPU at ever faster speeds. As AI models keep getting bigger, the sheer volume of data needed for training requires techniques such as multinode training to achieve results in a reasonable timeframe. Signal processing for 5G is more sophisticated than previous generations, and GPUs can help increase the��

]]> 0 Charlie Huang <![CDATA[Expanding Hybrid-Cloud Support in Virtualized Data Centers with New NVIDIA AI Enterprise Integrations]]> http://www.open-lab.net/blog/?p=45215 2023-02-13T18:46:27Z 2022-03-15T05:11:33Z

The new year has been off to a great start with NVIDIA AI Enterprise 1.1 providing production support for container orchestration and Kubernetes cluster...]]>

The new year has been off to a great start with NVIDIA AI Enterprise 1.1 providing production support for container orchestration and Kubernetes cluster...

ai-enterprise-ga-launch-catalog

The new year has been off to a great start with NVIDIA AI Enterprise 1.1 providing production support for container orchestration and Kubernetes cluster management using VMware vSphere with Tanzu 7.0 update 3c, delivering AI/ML workloads to every business in VMs, containers, or Kubernetes. New NVIDIA AI Enterprise labs for IT admins and MLOps are available on NVIDIA LaunchPad��

]]> 0 Micah Guimarin <![CDATA[University of Pisa Enables Remote Research with Virtualized AI]]> http://www.open-lab.net/blog/?p=38871 2023-11-02T20:29:24Z 2021-10-26T00:11:00Z

Like so many schools and universities during the COVID-19 pandemic, the University of Pisa, one of Italy��s oldest universities, was challenged to find...]]>

Like so many schools and universities during the COVID-19 pandemic, the University of Pisa, one of Italy��s oldest universities, was challenged to find...

Newsletter-gtc-fall-2021-ai-enterprise-blog-u-of-pisa-blog-1-16x9

Like so many schools and universities during the COVID-19 pandemic, the University of Pisa, one of Italy��s oldest universities, was challenged to find solutions for conducting research while many students had to attend classes remotely. The university��s Research Computing department has a focused study in AI deep learning and machine learning applications, where they normally perform��

]]> 0 Brad Nemire <![CDATA[NVIDIA AI Enterprise �C Optimized, Certified and Supported on VMware vSphere]]> https://news.www.open-lab.net/?p=19474 2022-08-21T23:51:07Z 2021-03-09T15:12:07Z

NVIDIA AI Enterprise is?a?suite of?AI software,?certified to run on VMware vSphere 7 Update 2?with?NVIDIA-Certified?volume servers. It includes...]]>

NVIDIA AI Enterprise is?a?suite of?AI software,?certified to run on VMware vSphere 7 Update 2?with?NVIDIA-Certified?volume servers. It includes...

vmware featured

NVIDIA AI Enterprise is?a?suite of?AI software,?certified to run on VMware vSphere 7 Update 2?with?NVIDIA-Certified?volume servers. It includes key enabling technologies?and software from NVIDIA for?rapid deployment, management and scaling of AI workloads?in the virtualized data center running on VMware vSphere.?The NVIDIA AI Enterprise suite also enables IT Administrators, Data Scientists��

]]> 0 ��˳��97caoporen��