Jaydeep Marathe – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-29T19:05:10Z http://www.open-lab.net/blog/feed/ Jaydeep Marathe <![CDATA[CUDA C++ Compiler Updates Impacting ELF Visibility and Linkage]]> http://www.open-lab.net/blog/?p=99693 2025-05-29T19:05:10Z 2025-05-09T16:51:02Z In the next CUDA major release, CUDA 13.0, NVIDIA is introducing two significant changes to the NVIDIA CUDA Compiler Driver (NVCC) that will impact ELF...]]>

Source

]]>
1
Jaydeep Marathe <![CDATA[Reducing Application Build Times Using CUDA C++ Compilation Aids]]> http://www.open-lab.net/blog/?p=38989 2022-08-21T23:52:55Z 2021-10-26T05:04:15Z The CUDA 11.5 C++ compiler addresses a growing customer request. Specifically, how to reduce CUDA application build times. Along with eliminating unused...]]>

Source

]]>
1
Jaydeep Marathe <![CDATA[Programming Efficiently with the NVIDIA CUDA 11.3 Compiler Toolchain]]> http://www.open-lab.net/blog/?p=29901 2023-12-30T00:42:34Z 2021-04-16T00:40:00Z The CUDA 11.3 release of the CUDA C++ compiler toolchain incorporates new features aimed at improving developer productivity and code performance. NVIDIA is...]]>

The CUDA 11.3 release of the CUDA C++ compiler toolchain incorporates new features aimed at improving developer productivity and code performance. NVIDIA is introducing cu++flt, a standalone demangler tool that allows you to decode mangled function names to aid source code correlation. Starting with this release, the NVRTC shared library versioning scheme is relaxed to facilitate compatible…

Source

]]>
2
Jaydeep Marathe <![CDATA[Boosting Productivity and Performance with the NVIDIA CUDA 11.2 C++ Compiler]]> http://www.open-lab.net/blog/?p=23916 2022-08-21T23:41:02Z 2021-02-13T02:30:28Z The 11.2 CUDA C++ compiler incorporates features and enhancements aimed at improving developer productivity and the performance of GPU-accelerated applications....]]>

The 11.2 CUDA C++ compiler incorporates features and enhancements aimed at improving developer productivity and the performance of GPU-accelerated applications. The compiler toolchain gets an LLVM upgrade to 7.0, which enables new features and can help improve compiler code generation for NVIDIA GPUs. Link-time optimization (LTO) for device code (also known as device LTO)…

Source

]]>
0
Jaydeep Marathe <![CDATA[New Compiler Features in CUDA 8]]> http://www.open-lab.net/blog/parallelforall/?p=7346 2022-08-21T23:38:01Z 2016-11-08T07:14:00Z CUDA 8 is one of the most significant updates in the history of the CUDA platform. In addition to Unified Memory and the many new API and library features in...]]>

Source

]]>
3
���˳���97caoporen����