Jackson Marusarz – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-08-28T17:56:46Z http://www.open-lab.net/blog/feed/ Jackson Marusarz <![CDATA[Just Released: Nsight Compute 2024.3]]> http://www.open-lab.net/blog/?p=86723 2024-08-28T17:28:41Z 2024-08-02T15:30:00Z Nsight Compute 2024.3 improves selectively exporting results into a new report, kernel name logging to debug empty reports, and profiling green contexts.]]>

Nsight Compute 2024.3 improves selectively exporting results into a new report, kernel name logging to debug empty reports, and profiling green contexts.

Source

]]>
Jackson Marusarz <![CDATA[CUDA Toolkit 12.4 Enhances Support for NVIDIA Grace Hopper and Confidential Computing]]> http://www.open-lab.net/blog/?p=79119 2024-08-28T17:32:44Z 2024-03-06T19:55:00Z The latest release of CUDA Toolkit, version 12.4, continues to push accelerated computing performance using the latest NVIDIA GPUs. This post explains the new...]]>

The latest release of CUDA Toolkit, version 12.4, continues to push accelerated computing performance using the latest NVIDIA GPUs. This post explains the new features and enhancements included in this release: CUDA and the CUDA Toolkit software provide the foundation for all NVIDIA GPU-accelerated computing applications in data science and analytics, machine learning…

Source

]]>
Jackson Marusarz <![CDATA[CUDA Toolkit 12.3 Delivers New Features for Accelerated Computing]]> http://www.open-lab.net/blog/?p=71735 2024-08-28T17:33:55Z 2023-11-01T16:00:00Z The latest release of CUDA Toolkit continues to push the envelope of accelerated computing performance using the latest NVIDIA GPUs. New features of this...]]>

The latest release of CUDA Toolkit continues to push the envelope of accelerated computing performance using the latest NVIDIA GPUs. New features of this release, version 12.3, include: CUDA and the CUDA Toolkit continue to provide the foundation for all accelerated computing applications in data science, machine learning and deep learning, generative AI with LLMs for both training and…

Source

]]>
0
Jackson Marusarz <![CDATA[New Video Series: CUDA Developer Tools Tutorials]]> http://www.open-lab.net/blog/?p=71058 2024-08-28T17:35:38Z 2023-09-25T17:00:00Z GPU acceleration is enabling faster and more intelligent applications than ever before, and the CUDA Toolkit is key to harnessing acceleration on NVIDIA GPUs....]]>

GPU acceleration is enabling faster and more intelligent applications than ever before, and the CUDA Toolkit is key to harnessing acceleration on NVIDIA GPUs. But debugging, profiling, and optimizing CUDA can be a challenge, especially if you are unable to inspect hardware-level throughput and performance. To help you harness CUDA acceleration, NVIDIA offers Nsight Developer Tools.

Source

]]>
0
Jackson Marusarz <![CDATA[CUDA Toolkit 12.2 Unleashes Powerful Features for Boosting Applications]]> http://www.open-lab.net/blog/?p=67705 2024-08-28T17:39:00Z 2023-07-06T19:16:56Z The latest release of CUDA Toolkit 12.2 introduces a range of essential new features, modifications to the programming model, and enhanced support for hardware...]]>

The latest release of CUDA Toolkit 12.2 introduces a range of essential new features, modifications to the programming model, and enhanced support for hardware capabilities accelerating CUDA applications. Now out through general availability from NVIDIA, CUDA Toolkit 12.2 includes many new capabilities, both major and minor. The following post offers an overview of many of the key…

Source

]]>
0
Jackson Marusarz <![CDATA[Improve Guidance and Performance Visualization with the New Nsight Compute]]> http://www.open-lab.net/blog/?p=48546 2024-08-28T17:45:26Z 2022-05-31T16:00:00Z NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user...]]>

NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user interface and a command-line tool. Nsight Compute 2022.2 includes features to expand the supported environments and workflows for CUDA kernel profiling and optimization. Download now. >> The following outlines the feature highlights of…

Source

]]>
0
Jackson Marusarz <![CDATA[Advanced Kernel Profiling with the Latest Nsight Compute]]> http://www.open-lab.net/blog/?p=43640 2024-08-28T17:46:06Z 2022-01-27T17:58:33Z NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user...]]>

NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user interface and a command-line tool. Nsight Compute 2022.1 brings updates to improve data collection modes enabling new use cases and options for performance profiling. Download Now>> This release of Nsight Compute extends the…

Source

]]>
0
Jackson Marusarz <![CDATA[Optimizing GPU Utilization with Nsight Compute 2021.3]]> http://www.open-lab.net/blog/?p=39142 2024-08-28T17:47:12Z 2021-10-26T05:06:50Z NVIDIA announced the latest Nsight Compute 2021.3 with new features for measuring and modeling occupancy, source and assembly code correlation, and a...]]>

NVIDIA announced the latest Nsight Compute 2021.3 with new features for measuring and modeling occupancy, source and assembly code correlation, and a hierarchical roofline model to identify bottlenecks caused by accessing cache memory. Nsight Compute 2021.3 adds a new Occupancy Calculator activity that helps you understand the hardware resource utilization of their kernels and model how…

Source

]]>
0
Jackson Marusarz <![CDATA[Accelerating HPC Applications with NVIDIA Nsight Compute Roofline Analysis]]> http://www.open-lab.net/blog/?p=22103 2024-08-28T17:55:11Z 2020-11-18T16:00:00Z Writing high-performance software is no simple task. After you have code that can compile and run, a new challenge is introduced when you try and understand how...]]>

Writing high-performance software is no simple task. After you have code that can compile and run, a new challenge is introduced when you try and understand how it is performing on the available hardware. Different platforms, whether they are CPUs, GPUs, or something else, will have different hardware limitations like available memory bandwidth and theoretical compute limits.

Source

]]>
2
Jackson Marusarz <![CDATA[Using NVIDIA Nsight Compute in Containers]]> http://www.open-lab.net/blog/?p=19664 2024-08-28T17:55:43Z 2020-08-14T20:11:36Z Containers are now ubiquitous, and for good reason; the portability and productivity enhancements they provide have made them a standard component in HPC and...]]>

Containers are now ubiquitous, and for good reason; the portability and productivity enhancements they provide have made them a standard component in HPC and many other computing fields. The NVIDIA Nsight family of developer tools for analyzing performance of CUDA applications are supported in container environments. For more information about the environmental landscape and Nsight Systems…

Source

]]>
0
Jackson Marusarz <![CDATA[Unleashing the Power of NVIDIA Ampere Architecture with NVIDIA Nsight Developer Tools]]> http://www.open-lab.net/blog/?p=17447 2024-08-28T17:56:46Z 2020-05-14T13:00:00Z The NVIDIA Ampere GPU architecture has arrived! It��s time to make sure that your applications are getting the most out of the powerful compute resources in...]]>

The NVIDIA Ampere GPU architecture has arrived! It’s time to make sure that your applications are getting the most out of the powerful compute resources in this new architecture. With the release of CUDA 11, we are adding several features to the Nsight family of Developer Tools to help you do just that. These additions improve usability, productivity, and make it easier for you to find bugs…

Source

]]>
0
���˳���97caoporen����