Oliver Knieps – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-09-07T18:38:16Z http://www.open-lab.net/blog/feed/ Oliver Knieps <![CDATA[Deploying YOLOv5 on NVIDIA Jetson Orin with cuDLA: Quantization-Aware Training to Inference]]> http://www.open-lab.net/blog/?p=69996 2023-09-07T18:38:16Z 2023-08-31T17:00:00Z NVIDIA Jetson Orin is the best-in-class embedded platform for AI workloads. One of the key components of the Orin platform is the second-generation Deep...]]>

NVIDIA Jetson Orin is the best-in-class embedded platform for AI workloads. One of the key components of the Orin platform is the second-generation Deep Learning Accelerator (DLA), the dedicated deep learning inference engine that offers one-third of the AI compute on the AGX Orin platforms. This post is a deep technical dive into how embedded developers working with Orin platforms can…

Source

]]>
0
Oliver Knieps <![CDATA[Maximizing Deep Learning Performance on NVIDIA Jetson Orin with DLA]]> http://www.open-lab.net/blog/?p=69416 2023-08-25T20:40:41Z 2023-08-16T18:04:49Z NVIDIA Jetson Orin is the best-in-class embedded AI platform. The Jetson Orin SoC module has the NVIDIA Ampere architecture GPU at its core but there is a lot...]]>

NVIDIA Jetson Orin is the best-in-class embedded AI platform. The Jetson Orin SoC module has the NVIDIA Ampere architecture GPU at its core but there is a lot more compute on the SoC: The NVIDIA Orin SoC is powerful, with 275 peak AI TOPs, making it the best embedded and automotive AI platform. Did you know that almost 40% of these AI TOPs come from the two DLAs on NVIDIA Orin?

Source

]]>
0
Oliver Knieps <![CDATA[Discovering GPU-friendly Deep Neural Networks with Unified Neural Architecture Search]]> http://www.open-lab.net/blog/?p=21847 2022-08-21T23:40:45Z 2020-11-05T21:29:02Z After the first successes of deep learning, designing neural network architectures with desirable performance criteria for a given task (for example, high...]]>

After the first successes of deep learning, designing neural network architectures with desirable performance criteria for a given task (for example, high accuracy or low latency) has been a challenging problem. Some call it alchemy and some intuition, but the task of discovering a novel architecture often involves a tedious and costly trial-and-error process of searching in an exponentially large…

Source

]]>
0
���˳���97caoporen����