Arm – NVIDIA Technical Blog

Arm – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-19T06:00:00Z http://www.open-lab.net/blog/feed/ Ivan Goldwasser <![CDATA[NVIDIA Grace CPU Integrates with the Arm Software Ecosystem]]> http://www.open-lab.net/blog/?p=95638 2025-04-23T02:52:39Z 2025-02-10T18:45:22Z

The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...]]>

The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the... Picture of the NVIDIA Grace CPU on a black background.

Picture of the NVIDIA Grace CPU on a black background.

The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the Grace CPU is designed to handle demanding workloads while consuming less power. NVIDIA believes in the benefit of leveraging GPUs to accelerate every workload. However, not all workloads are accelerated. This is especially true for those��

]]> 0 Jonathon Evans <![CDATA[NVIDIA Grace CPU Superchip Architecture In Depth]]> http://www.open-lab.net/blog/?p=59829 2023-12-06T23:46:07Z 2023-01-20T19:30:00Z

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC)...]]>

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC)... Grace CPU Superchip illustration.

Grace CPU Superchip illustration.

The NVIDIA Grace CPU is the first data center CPU developed by NVIDIA. Combining NVIDIA expertise with Arm processors, on-chip fabrics, system-on-chip (SoC) design, and resilient high-bandwidth low-power memory technologies, the Grace CPU was built from the ground up to create the world��s first superchip for computing. At the heart of the superchip, lies the NVLink Chip-2-Chip (C2C).

]]> 2 Neeraj Srivastava <![CDATA[Evaluating Applications Using the NVIDIA Arm HPC Developer Kit]]> http://www.open-lab.net/blog/?p=57399 2023-07-24T19:42:40Z 2022-11-16T18:00:00Z

The NVIDIA Arm HPC Developer Kit is an integrated hardware and software platform for creating, evaluating, and benchmarking HPC, AI, and scientific computing...]]>

The NVIDIA Arm HPC Developer Kit is an integrated hardware and software platform for creating, evaluating, and benchmarking HPC, AI, and scientific computing...

The NVIDIA Arm HPC Developer Kit is an integrated hardware and software platform for creating, evaluating, and benchmarking HPC, AI, and scientific computing applications on a heterogeneous GPU- and CPU-accelerated computing system. NVIDIA announced its availability in March of 2021. The kit is designed as a stepping stone to the next-generation NVIDIA Grace Hopper Superchip for HPC and AI��

]]> 0 Jay Gould <![CDATA[Just Released: HPC SDK v22.9]]> http://www.open-lab.net/blog/?p=54598 2023-06-12T08:56:53Z 2022-10-12T19:00:00Z

This version 22.9 update to the NVIDIA HPC SDK includes fixes and minor enhancements.]]>

This version 22.9 update to the NVIDIA HPC SDK includes fixes and minor enhancements. Four panels vertically laid out each showing a simulation with a black background

Four panels vertically laid out each showing a simulation with a black background

This version 22.9 update to the NVIDIA HPC SDK includes fixes and minor enhancements.

]]> 0 Charu Chaubal <![CDATA[Powering NVIDIA-Certified Enterprise Systems with Arm CPUs]]> http://www.open-lab.net/blog/?p=55618 2022-10-20T19:33:18Z 2022-09-28T17:00:00Z

Organizations are rapidly becoming more advanced in the use of AI, and many are looking to leverage the latest technologies to maximize workload performance and...]]>

Organizations are rapidly becoming more advanced in the use of AI, and many are looking to leverage the latest technologies to maximize workload performance and...

Temp-1920x1080 (3)

Organizations are rapidly becoming more advanced in the use of AI, and many are looking to leverage the latest technologies to maximize workload performance and efficiency. One of the most prevalent trends today is the use of CPUs based on Arm architecture to build data center servers. To ensure that these new systems are enterprise-ready and optimally configured, NVIDIA has approved the��

]]> 0 Shar Narasimhan <![CDATA[NVIDIA, Arm, and Intel Publish FP8 Specification for Standardization as an Interchange Format for AI]]> http://www.open-lab.net/blog/?p=54825 2023-02-13T19:01:09Z 2022-09-14T15:00:00Z

AI processing requires full-stack innovation across hardware and software platforms to address the growing computational demands of neural networks. A key area...]]>

AI processing requires full-stack innovation across hardware and software platforms to address the growing computational demands of neural networks. A key area...

egx-data-center-kv-2048x1024

AI processing requires full-stack innovation across hardware and software platforms to address the growing computational demands of neural networks. A key area to drive efficiency is using lower precision number formats to improve computational efficiency, reduce memory usage, and optimize for interconnect bandwidth. To realize these benefits, the industry has moved from 32-bit precisions to��

]]> 1 Jay Gould <![CDATA[Just Released: New Arm CPU Support and Advancements in HPC SDK 22.7]]> http://www.open-lab.net/blog/?p=50923 2022-09-09T16:10:22Z 2022-07-27T20:00:00Z

This release includes enhancements, fixes, and new support for Arm SVE, Rocky Linux OS, and Amazon EC2 C7g instances, powered by the latest generation AWS...]]>

This release includes enhancements, fixes, and new support for Arm SVE, Rocky Linux OS, and Amazon EC2 C7g instances, powered by the latest generation AWS... Four panels vertically laid out each showing a simulation with a black background

Four panels vertically laid out each showing a simulation with a black background

This release includes enhancements, fixes, and new support for Arm SVE, Rocky Linux OS, and Amazon EC2 C7g instances, powered by the latest generation AWS Graviton3 processors.

]]> 0 Uttara Kumar <![CDATA[AWS Launches First NVIDIA GPU-Accelerated Graviton-Based Instance with Amazon EC2 G5g]]> http://www.open-lab.net/blog/?p=41688 2022-08-21T23:53:09Z 2021-11-29T17:57:46Z

Today at AWS re:Invent 2021, AWS announced the general availability of Amazon EC2 G5g instances��bringing the first NVIDIA GPU-accelerated Arm-based instance...]]>

Today at AWS re:Invent 2021, AWS announced the general availability of Amazon EC2 G5g instances��bringing the first NVIDIA GPU-accelerated Arm-based instance...

SOL_DevBlog_Thumbnail

Today at AWS re:Invent 2021, AWS announced the general availability of Amazon EC2 G5g instances��bringing the first NVIDIA GPU-accelerated Arm-based instance to the AWS cloud. The new EC2 G5g instance features AWS Graviton2 processors, based on the 64-bit Arm Neoverse cores, and NVIDIA T4G Tensor Core GPUs, enhanced for graphics-intensive applications. This powerful combination creates an��

]]> 0 Neeraj Srivastava <![CDATA[Develop the Next Generation of HPC Applications with the NVIDIA Arm HPC Developer Kit]]> http://www.open-lab.net/blog/?p=41190 2022-08-21T23:53:06Z 2021-11-15T23:30:00Z

In July of 2021, NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit for preordering, along with the NVIDIA HPC SDK. Since then NVIDIA and its...]]>

In July of 2021, NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit for preordering, along with the NVIDIA HPC SDK. Since then NVIDIA and its... Graphic of HPC SDK.

Graphic of HPC SDK.

In July of 2021, NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit for preordering, along with the NVIDIA HPC SDK. Since then NVIDIA and its partners have been working hard to get units into the hands of developers, to increase global availability, and enhance the software stack. The NVIDIA Arm HPC Developer Kit is based on the GIGABYTE G242-P32 2U server.

]]> 0 Dave Salvator <![CDATA[Furthering NVIDIA Performance Leadership with MLPerf Inference 1.1 Results]]> http://www.open-lab.net/blog/?p=37689 2023-07-05T19:30:25Z 2021-09-22T17:00:00Z

AI continues to drive breakthrough innovation across industries, including consumer Internet, healthcare and life sciences, financial services, retail,...]]>

AI continues to drive breakthrough innovation across industries, including consumer Internet, healthcare and life sciences, financial services, retail,...

mlperf-september-announcement

AI continues to drive breakthrough innovation across industries, including consumer Internet, healthcare and life sciences, financial services, retail, manufacturing, and supercomputing. Researchers continue to push the boundaries of what��s possible with rapidly evolving models that are growing in size, complexity, and diversity. In addition, many of these complex, large-scale models need to��

]]> 0 Jay Gould <![CDATA[NVIDIA Announces Availability for Arm HPC Developer Kit with New HPC SDK v21.7]]> http://www.open-lab.net/blog/?p=34947 2022-08-21T23:52:18Z 2021-07-22T21:18:24Z

Today NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit with the NVIDIA HPC SDK version 21.7. The DevKit is an integrated...]]>

Today NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit with the NVIDIA HPC SDK version 21.7. The DevKit is an integrated... Graphic of HPC SDK.

Graphic of HPC SDK.

Today NVIDIA announced the availability of the NVIDIA Arm HPC Developer Kit with the NVIDIA HPC SDK version 21.7. The DevKit is an integrated hardware-software platform for creating, evaluating, and benchmarking HPC, AI, and scientific computing applications for Arm server based accelerated platforms. The HPC SDK v21.7 is the latest update of the software development kit, and fully supports the��

]]> 5 Brad Nemire <![CDATA[HPC �C Top Resources from GTC 21]]> http://www.open-lab.net/blog/?p=32554 2024-08-28T17:48:38Z 2021-06-04T20:47:53Z

Get the latest resources and news about the NVIDIA technologies that are accelerating the latest innovations in HPC from industry leaders and developers....]]>

Get the latest resources and news about the NVIDIA technologies that are accelerating the latest innovations in HPC from industry leaders and developers....

HPC_DevBlog

Get the latest resources and news about the NVIDIA technologies that are accelerating the latest innovations in HPC from industry leaders and developers. Explore sessions and demos across a variety of HPC topics, ranging from weather forecasting and energy exploration to computational chemistry and molecular dynamics. The developer resources listed below are exclusively available to NVIDIA��

]]> 0 Nefi Alarcon <![CDATA[What��s New to NGC: HPC Containers for A100 and Arm Systems]]> https://news.www.open-lab.net/?p=17347 2022-08-21T23:40:02Z 2020-06-25T22:21:21Z

Researchers are harnessing the power of NVIDIA GPUs more than ever before to find a cure for COVID-19. Leveraging popular molecular dynamics and quantum...]]>

Researchers are harnessing the power of NVIDIA GPUs more than ever before to find a cure for COVID-19. Leveraging popular molecular dynamics and quantum...

NGC HPC featured

Researchers are harnessing the power of NVIDIA GPUs more than ever before to find a cure for COVID-19. Leveraging popular molecular dynamics and quantum chemistry HPC applications, they are running thousands of experiments to predict which compounds can effectively bind with protein and block the virus from affecting our cells. NGC has recently introduced updated versions of these popular��

]]> 0 Dustin Franklin https://www.linkedin.com/in/dustin-franklin-b3aaa173 <![CDATA[NVIDIA Jetson AGX Xavier Delivers 32 TeraOps for New Era of AI in Robotics]]> http://www.open-lab.net/blog/?p=13065 2022-08-21T23:39:16Z 2018-12-13T04:00:57Z

The world��s ultimate?embedded solution for AI developers, Jetson AGX Xavier, is now shipping as?standalone production modules from NVIDIA. A member of...]]>

The world��s ultimate?embedded solution for AI developers, Jetson AGX Xavier, is now shipping as?standalone production modules from NVIDIA. A member of...

NVIDIA_Jetson-Xavier_RobotKV_PRESS_1600-HR

The world��s ultimate embedded solution for AI developers, Jetson AGX Xavier, is now shipping as standalone production modules from NVIDIA. A member of NVIDIA��s AGX Systems for autonomous machines, Jetson AGX Xavier is ideal for deploying advanced AI and computer vision to the edge, enabling robotic platforms in the field with workstation-level performance and the ability to operate fully��

]]> 10 Kudbudeen Jalaludeen <![CDATA[CUDA Development for Jetson with NVIDIA Nsight Eclipse Edition]]> http://www.open-lab.net/blog/parallelforall/?p=7632 2024-08-28T17:59:42Z 2017-03-20T03:19:11Z

[caption id="attachment_7587" align="alignright" width="300"] Figure 1: NVIDIA Jetson TX2 Developer Kit.[/caption] NVIDIA Nsight Eclipse Edition is a...]]>

[caption id="attachment_7587" align="alignright" width="300"] Figure 1: NVIDIA Jetson TX2 Developer Kit.[/caption] NVIDIA Nsight Eclipse Edition is a...

NVIDIA Nsight Eclipse Edition is a full-featured, integrated development environment that lets you easily develop CUDA applications for either your local (x86) system or a remote (x86 or Arm) target. In this post, I will walk you through the process of remote-developing CUDA applications for the NVIDIA Jetson TX2, an Arm-based development kit. Note that this how-to also applies to Jetson TX1 and��

]]> 15 Dustin Franklin https://www.linkedin.com/in/dustin-franklin-b3aaa173 <![CDATA[NVIDIA Jetson TX2 Delivers Twice the Intelligence to the Edge]]> http://www.open-lab.net/blog/parallelforall/?p=7575 2022-10-10T18:49:18Z 2017-03-08T02:05:38Z

[caption id="attachment_7576" align="alignright" width="300"] Figure 1: NVIDIA Jetson TX2 embedded system-on-module with Thermal Transfer Plate (TTP).[/caption]...]]>

[caption id="attachment_7576" align="alignright" width="300"] Figure 1: NVIDIA Jetson TX2 embedded system-on-module with Thermal Transfer Plate (TTP).[/caption]...

Figure 1: NVIDIA Jetson TX2 embedded system-on-module with Thermal Transfer Plate (TTP).

Today at an AI meetup in San Francisco, NVIDIA launched Jetson TX2 and the JetPack 3.0 AI SDK. Jetson is the world��s leading low-power embedded platform, enabling server-class AI compute performance for edge devices everywhere. Jetson TX2 features an integrated 256-core NVIDIA Pascal GPU, a hex-core ARMv8 64-bit CPU complex, and 8GB of LPDDR4 memory with a 128-bit interface.

]]> 8 Stephen Jones <![CDATA[Embedded Machine Learning with the cuDNN Deep Neural Network Library and Jetson TK1]]> http://www.open-lab.net/blog/parallelforall/?p=4075 2022-08-21T23:37:29Z 2014-11-14T01:53:50Z

GPUs have quickly become the go-to platform for accelerating machine learning applications for training and classification. Deep Neural Networks (DNNs) have...]]>

GPUs have quickly become the go-to platform for accelerating machine learning applications for training and classification. Deep Neural Networks (DNNs) have...

cuDNN_logo_black_on_white_179x115

GPUs have quickly become the go-to platform for accelerating machine learning applications for training and classification. Deep Neural Networks (DNNs) have grown in importance for many applications, from image classification and natural language processing to robotics and UAVs. To help researchers focus on solving core problems, NVIDIA introduced a library of primitives for deep neural networks��

]]> 1 Mark Harris <![CDATA[10 Ways CUDA 6.5 Improves Performance and Productivity]]> http://www.open-lab.net/blog/parallelforall/?p=3436 2025-05-01T18:34:22Z 2014-08-20T13:00:38Z

Today we're excited to announce the release of the CUDA Toolkit version 6.5. CUDA 6.5 adds a number of features and improvements to the CUDA platform, including...]]>

Today we're excited to announce the release of the CUDA Toolkit version 6.5. CUDA 6.5 adds a number of features and improvements to the CUDA platform, including...

Today we��re excited to announce the release of the CUDA Toolkit version 6.5. CUDA 6.5 adds a number of features and improvements to the CUDA platform, including support for CUDA Fortran in developer tools, user-defined callback functions in cuFFT, new occupancy calculator APIs, and more. Last year we introduced CUDA on Arm, and in March we released the Jetson TK1 developer board��

]]> 21 Dustin Franklin https://www.linkedin.com/in/dustin-franklin-b3aaa173 <![CDATA[Low-Power Sensing and Autonomy With NVIDIA Jetson TK1]]> http://www.open-lab.net/blog/parallelforall/?p=3339 2022-08-21T23:37:06Z 2014-06-25T18:04:02Z

[caption id="attachment_3344" align="alignright" width="314"] Figure 1: simple TK1 block diagram[/caption] NVIDIA��s Tegra K1 (TK1) is the first Arm...]]>

[caption id="attachment_3344" align="alignright" width="314"] Figure 1: simple TK1 block diagram[/caption] NVIDIA��s Tegra K1 (TK1) is the first Arm...

vehicle_detectors

NVIDIA��s Tegra K1 (TK1) is the first Arm system-on-chip (SoC) with integrated CUDA. With 192 Kepler GPU cores and four Arm Cortex-A15 cores delivering a total of 327 GFLOPS of compute performance, TK1 has the capacity to process lots of data with CUDA while typically drawing less than 6W of power (including the SoC and DRAM). This brings game-changing performance to low-SWaP (Size��

]]> 6 Satish Salian <![CDATA[NVIDIA Nsight Eclipse Edition for Jetson TK1]]> http://www.open-lab.net/blog/parallelforall/?p=3255 2024-08-28T18:00:33Z 2014-05-27T17:50:42Z

NVIDIA Nsight Eclipse Edition is a full-featured, integrated development environment that lets you easily develop CUDA applications for either your local (x86)...]]>

NVIDIA Nsight Eclipse Edition is a full-featured, integrated development environment that lets you easily develop CUDA applications for either your local (x86)...

NVIDIA Nsight Eclipse Edition is a full-featured, integrated development environment that lets you easily develop CUDA applications for either your local (x86) system or a remote (x86 or Arm) target. In this post, I will walk you through the process of remote-developing CUDA applications for the NVIDIA Jetson TK1, an Arm-based development kit. Nsight supports two remote development modes: cross��

]]> 103 Mark Ebersole http://www.open-lab.net/blog/parallelforall <![CDATA[CUDACasts Episode #6: CUDA on ARM with CUDA 5.5]]> http://www.parallelforall.com/?p=1799 2022-08-21T23:36:56Z 2013-08-09T07:46:05Z

In CUDACast #5, we saw how to use the new NVIDIA RPM and Debian packages to install the CUDA toolkit, samples, and driver on a supported Linux OS with a...]]>

In CUDACast #5, we saw how to use the new NVIDIA RPM and Debian packages to install the CUDA toolkit, samples, and driver on a supported Linux OS with a...

CUDACasts_FeaturedImage

In CUDACast #5, we saw how to use the new NVIDIA RPM and Debian packages to install the CUDA toolkit, samples, and driver on a supported Linux OS with a standard package manager. With CUDA 5.5, it is now possible to compile and run CUDA applications on Arm-based systems such as the Kayla development platform. In addition to native compilation on an Arm-based CPU system, it is also possible to��

]]> 0 ��˳��97caoporen��