Maximilian M��ller – NVIDIA Technical Blog

Maximilian M��ller – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-15T19:08:48Z http://www.open-lab.net/blog/feed/ Maximilian M��ller <![CDATA[Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT]]> http://www.open-lab.net/blog/?p=98927 2025-05-15T19:08:48Z 2025-04-21T18:44:38Z

State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant...]]>

State-of-the-art image diffusion models take tens of seconds to process a single image. This makes video diffusion even more challenging, requiring significant computational resources and high costs. By leveraging the latest FP8 quantization features on NVIDIA Hopper GPUs with NVIDIA TensorRT, it’s possible to significantly reduce inference costs and serve more users with fewer GPUs.

]]> Maximilian M��ller <![CDATA[Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines]]> http://www.open-lab.net/blog/?p=83568 2024-11-14T15:55:20Z 2024-06-11T16:33:50Z

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...]]>

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX GPUs. Now, deploying TensorRT into apps has gotten even easier with prebuilt TensorRT engines. The newly released TensorRT 10.0 with weight-stripped engines offers a unique solution for minimizing the engine shipment size by reducing…

]]> Maximilian M��ller <![CDATA[Calculating Video Quality Using NVIDIA GPUs and VMAF-CUDA]]> http://www.open-lab.net/blog/?p=77541 2024-04-09T23:45:26Z 2024-03-12T16:57:38Z

Video quality metrics are used to evaluate the fidelity of video content. They provide a consistent quantitative measurement to assess the performance of the...]]>

Video quality metrics are used to evaluate the fidelity of video content. They provide a consistent quantitative measurement to assess the performance of the encoder. VMAF combines human vision modeling with machine learning techniques that are continuously evolving, enabling it to adapt to new content. VMAF excels in aligning with human visual perception by combining detailed analysis…

]]> Maximilian M��ller <![CDATA[End-to-End AI for NVIDIA-Based PCs: NVIDIA TensorRT Deployment]]> http://www.open-lab.net/blog/?p=61010 2023-06-09T22:41:06Z 2023-03-15T16:30:00Z

This post is the fifth in a series about optimizing end-to-end AI. NVIDIA TensorRT is a solution for speed-of-light inference deployment on NVIDIA hardware....]]>

This post is the fifth in a series about optimizing end-to-end AI. NVIDIA TensorRT is a solution for speed-of-light inference deployment on NVIDIA hardware. Provided with an AI model architecture, TensorRT can be used pre-deployment to run an excessive search for the most efficient execution strategy. TensorRT optimizations include reordering operations in a graph…

]]> 0 Maximilian M��ller <![CDATA[End-to-End AI for NVIDIA-Based PCs: CUDA and TensorRT Execution Providers in ONNX Runtime]]> http://www.open-lab.net/blog/?p=60430 2023-06-12T07:55:55Z 2023-02-08T19:16:29Z

This post is the fourth in a series about optimizing end-to-end AI. As explained in the previous post in the End-to-End AI for NVIDIA-Based PCs series, there...]]>

This post is the fourth in a series about optimizing end-to-end AI. As explained in the previous post in the End-to-End AI for NVIDIA-Based PCs series, there are multiple execution providers (EPs) in ONNX Runtime that enable the use of hardware-specific features or optimizations for a given deployment scenario. This post covers the CUDA EP and TensorRT EP using the highly optimized NVIDIA…

]]> 6 ��˳��97caoporen��