Sukru Burc Eryilmaz – NVIDIA Technical Blog

Sukru Burc Eryilmaz – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-11-14T17:10:37Z http://www.open-lab.net/blog/feed/ Sukru Burc Eryilmaz <![CDATA[NVIDIA Blackwell Doubles LLM Training Performance in MLPerf Training v4.1]]> http://www.open-lab.net/blog/?p=91807 2024-11-14T17:10:37Z 2024-11-13T16:00:00Z

As models grow larger and are trained on more data, they become more capable, making them more useful. To train these models quickly, more performance,...]]>

As models grow larger and are trained on more data, they become more capable, making them more useful. To train these models quickly, more performance, delivered at data center scale, is required. The NVIDIA Blackwell platform, launched at GTC 2024 and now in full production, integrates seven types of chips: GPU, CPU, DPU, NVLink Switch chip, InfiniBand Switch, and Ethernet Switch.

]]> Sukru Burc Eryilmaz <![CDATA[NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0]]> http://www.open-lab.net/blog/?p=83776 2024-06-27T18:18:05Z 2024-06-12T15:00:00Z

Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and...]]>

Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and more. And, as these models continue to grow in size and are trained on even more data, they are producing even higher-quality outputs. Building and deploying these more intelligent models is incredibly compute-intensive…

]]> Sukru Burc Eryilmaz <![CDATA[Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand]]> http://www.open-lab.net/blog/?p=72467 2023-11-24T18:36:30Z 2023-11-08T17:00:00Z

Generative AI is rapidly transforming computing, unlocking new use cases and turbocharging existing ones. Large language models (LLMs), such as OpenAI��s GPT...]]>

Generative AI is rapidly transforming computing, unlocking new use cases and turbocharging existing ones. Large language models (LLMs), such as OpenAI’s GPT models and Meta’s Llama 2, skillfully perform a variety of tasks on text-based content. These tasks include summarization, translation, classification, and generation of new content such as computer code, marketing copy, poetry, and much more.

]]> 0 Sukru Burc Eryilmaz <![CDATA[Breaking MLPerf Training Records with NVIDIA H100 GPUs]]> http://www.open-lab.net/blog/?p=66919 2023-07-13T19:00:28Z 2023-06-27T16:00:00Z

At the heart of the rapidly expanding set of AI-powered applications are powerful AI models. Before these models can be deployed, they must be trained through a...]]>

At the heart of the rapidly expanding set of AI-powered applications are powerful AI models. Before these models can be deployed, they must be trained through a process that requires an immense amount of AI computing power. AI training is also an ongoing process, with models constantly retrained with new data to ensure high-quality results. Faster model training means that AI-powered applications…

]]> 0 Sukru Burc Eryilmaz <![CDATA[Tuning AI Infrastructure Performance with MLPerf HPC v2.0 Benchmarks]]> http://www.open-lab.net/blog/?p=56880 2023-07-05T19:12:15Z 2022-11-09T18:00:00Z

As the fusion of AI and simulation accelerates scientific discovery, the need has arisen for a means to measure and rank the speed and throughput for building...]]>

As the fusion of AI and simulation accelerates scientific discovery, the need has arisen for a means to measure and rank the speed and throughput for building AI models of the world’s supercomputers. MLPerf HPC, now in its third iteration, has emerged as an industry-standard measure of system performance using workloads traditionally performed on supercomputers.

]]> 0 Sukru Burc Eryilmaz <![CDATA[Leading MLPerf Training 2.1 with Full Stack Optimizations for AI]]> http://www.open-lab.net/blog/?p=57148 2023-07-05T19:26:09Z 2022-11-09T18:00:00Z

MLPerf benchmarks, developed by MLCommons, are critical evaluation tools for organizations to measure the performance of their machine learning models' training...]]>

MLPerf benchmarks, developed by MLCommons, are critical evaluation tools for organizations to measure the performance of their machine learning models’ training across workloads. MLPerf Training v2.1—the seventh iteration of this AI training-focused benchmark suite—tested performance across a breadth of popular AI use cases, including the following: Many AI applications take advantage of…

]]> 0 Sukru Burc Eryilmaz <![CDATA[The Full Stack Optimization Powering NVIDIA MLPerf Training v2.0 Performance]]> http://www.open-lab.net/blog/?p=49597 2023-07-05T19:27:00Z 2022-06-30T18:00:00Z

MLPerf benchmarks are developed by a consortium of AI leaders across industry, academia, and research labs, with the aim of providing standardized, fair, and...]]>

MLPerf benchmarks are developed by a consortium of AI leaders across industry, academia, and research labs, with the aim of providing standardized, fair, and useful measures of deep learning performance. MLPerf training focuses on measuring time to train a range of commonly used neural networks for the following tasks: Lower training times are important to speed time to deployment…

]]> 0 Sukru Burc Eryilmaz <![CDATA[Boosting NVIDIA MLPerf Training v1.1 Performance with Full Stack Optimization]]> http://www.open-lab.net/blog/?p=41919 2023-07-05T19:29:06Z 2021-12-01T21:33:20Z

Five months have passed since v1.0, so it is time for another round of the MLPerf training benchmark. In this v1.1 edition, optimization over the entire...]]>

Five months have passed since v1.0, so it is time for another round of the MLPerf training benchmark. In this v1.1 edition, optimization over the entire hardware and software stack sees continuing improvement across the benchmarking suite for the submissions based on NVIDIA platform. This improvement is observed consistently at all different scales, from single machines all the way to industrial…

]]> 2 Sukru Burc Eryilmaz <![CDATA[MLPerf HPC v1.0: Deep Dive into Optimizations Leading to Record-Setting NVIDIA Performance]]> http://www.open-lab.net/blog/?p=41306 2023-07-05T19:29:32Z 2021-11-17T16:00:00Z

In MLPerf HPC v1.0, NVIDIA-powered systems won four of five new industry metrics focused on AI performance in HPC. As an industry-wide AI...]]>

In MLPerf HPC v1.0, NVIDIA-powered systems won four of five new industry metrics focused on AI performance in HPC. As an industry-wide AI consortium, MLPerf HPC evaluates a suite of performance benchmarks covering a range of widely used AI workloads. In this round, NVIDIA delivered 5x better results for CosmoFlow, and 7x more performance on DeepCAM, compared to strong scaling results from…

]]> 1 Sukru Burc Eryilmaz <![CDATA[MLPerf v1.0 Training Benchmarks: Insights into a Record-Setting NVIDIA Performance]]> http://www.open-lab.net/blog/?p=33929 2023-07-05T19:31:00Z 2021-06-30T17:00:00Z

MLPerf is an industry-wide AI consortium tasked with developing a suite of performance benchmarks that cover a range of leading AI workloads widely in use. The...]]>

MLPerf is an industry-wide AI consortium tasked with developing a suite of performance benchmarks that cover a range of leading AI workloads widely in use. The latest MLPerf v1.0 training round includes vision, language and recommender systems, and reinforcement learning tasks. It is continually evolving to reflect the state-of-the-art AI applications. NVIDIA submitted MLPerf v1.0…

]]> 1 ��˳��97caoporen��