Peter Entschev – NVIDIA Technical Blog

Peter Entschev – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-12-20T21:13:45Z http://www.open-lab.net/blog/feed/ Peter Entschev <![CDATA[Accelerating GPU Analytics Using RAPIDS and Ray]]> http://www.open-lab.net/blog/?p=94495 2024-12-20T21:13:45Z 2024-12-20T21:13:42Z

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...]]>

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and Dask. Ray is a popular open-source distributed Python framework commonly used to scale AI and machine learning (ML) applications. Ray particularly excels at simplifying and scaling training and inference pipelines and can easily target both…

]]> Peter Entschev <![CDATA[Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask]]> http://www.open-lab.net/blog/?p=92480 2024-12-12T19:38:40Z 2024-11-21T19:02:03Z

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth��multi-gpu training and analysis...]]>

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth—multi-gpu training and analysis grows in popularity. We need tools and also best practices as developers and practitioners move from CPU to GPU clusters. RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries. These libraries can easily scale-out for…

]]> Peter Entschev <![CDATA[Debugging a Mixed Python and C Language Stack]]> http://www.open-lab.net/blog/?p=63641 2023-06-09T22:28:19Z 2023-04-20T17:00:00Z

Debugging is difficult. Debugging across multiple languages is especially challenging, and debugging across devices often requires a team with varying skill...]]>

Debugging is difficult. Debugging across multiple languages is especially challenging, and debugging across devices often requires a team with varying skill sets and expertise to reveal the underlying problem. Yet projects often require using multiple languages, to ensure high performance where necessary, a user-friendly experience, and compatibility where possible. Unfortunately…

]]> 0 Peter Entschev <![CDATA[High-Performance Python Communication with UCX-Py]]> http://www.open-lab.net/blog/?p=32231 2022-08-21T23:51:45Z 2021-06-03T16:00:00Z

TL;DR UCX/UCX-Py is an accelerated networking library designed for low-latency high-bandwidth transfers for both host and GPU device memory objects. You...]]>

This post was originally published on the RAPIDS AI blog. UCX/UCX-Py is an accelerated networking library designed for low-latency high-bandwidth transfers for both host and GPU device memory objects. You can easily get started by installing through conda (limited to linux-64): RAPIDS is committed to delivering the highest achievable performance for the PyData ecosystem.

]]> 0 ��˳��97caoporen��