Nsight Compute 2024.3 improves selectively exporting results into a new report, kernel name logging to debug empty reports, and profiling green contexts.
]]>The latest release of CUDA Toolkit, version 12.4, continues to push accelerated computing performance using the latest NVIDIA GPUs. This post explains the new features and enhancements included in this release: CUDA and the CUDA Toolkit software provide the foundation for all NVIDIA GPU-accelerated computing applications in data science and analytics, machine learning…
]]>The latest release of CUDA Toolkit continues to push the envelope of accelerated computing performance using the latest NVIDIA GPUs. New features of this release, version 12.3, include: CUDA and the CUDA Toolkit continue to provide the foundation for all accelerated computing applications in data science, machine learning and deep learning, generative AI with LLMs for both training and…
]]>GPU acceleration is enabling faster and more intelligent applications than ever before, and the CUDA Toolkit is key to harnessing acceleration on NVIDIA GPUs. But debugging, profiling, and optimizing CUDA can be a challenge, especially if you are unable to inspect hardware-level throughput and performance. To help you harness CUDA acceleration, NVIDIA offers Nsight Developer Tools.
]]>The latest release of CUDA Toolkit 12.2 introduces a range of essential new features, modifications to the programming model, and enhanced support for hardware capabilities accelerating CUDA applications. Now out through general availability from NVIDIA, CUDA Toolkit 12.2 includes many new capabilities, both major and minor. The following post offers an overview of many of the key…
]]>NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user interface and a command-line tool. Nsight Compute 2022.2 includes features to expand the supported environments and workflows for CUDA kernel profiling and optimization. Download now. >> The following outlines the feature highlights of…
]]>NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user interface and a command-line tool. Nsight Compute 2022.1 brings updates to improve data collection modes enabling new use cases and options for performance profiling. Download Now>> This release of Nsight Compute extends the…
]]>NVIDIA announced the latest Nsight Compute 2021.3 with new features for measuring and modeling occupancy, source and assembly code correlation, and a hierarchical roofline model to identify bottlenecks caused by accessing cache memory. Nsight Compute 2021.3 adds a new Occupancy Calculator activity that helps you understand the hardware resource utilization of their kernels and model how…
]]>Writing high-performance software is no simple task. After you have code that can compile and run, a new challenge is introduced when you try and understand how it is performing on the available hardware. Different platforms, whether they are CPUs, GPUs, or something else, will have different hardware limitations like available memory bandwidth and theoretical compute limits.
]]>Containers are now ubiquitous, and for good reason; the portability and productivity enhancements they provide have made them a standard component in HPC and many other computing fields. The NVIDIA Nsight family of developer tools for analyzing performance of CUDA applications are supported in container environments. For more information about the environmental landscape and Nsight Systems…
]]>The NVIDIA Ampere GPU architecture has arrived! It’s time to make sure that your applications are getting the most out of the powerful compute resources in this new architecture. With the release of CUDA 11, we are adding several features to the Nsight family of Developer Tools to help you do just that. These additions improve usability, productivity, and make it easier for you to find bugs…
]]>