Joohoon Lee – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2023-06-12T21:06:31Z http://www.open-lab.net/blog/feed/ Joohoon Lee <![CDATA[Optimizing T5 and GPT-2 for Real-Time Inference with NVIDIA TensorRT]]> http://www.open-lab.net/blog/?p=41964 2023-06-12T21:06:31Z 2021-12-02T17:00:00Z The transformer architecture has wholly transformed (pun intended) the domain of natural language processing (NLP). Over the recent years, many novel network...]]>

Join the NVIDIA Triton and NVIDIA TensorRT community to stay current on the latest product updates, bug fixes, content, best practices, and more. The transformer architecture has wholly transformed (pun intended) the domain of natural language processing (NLP). Over the recent years, many novel network architectures have been built on the transformer building blocks: BERT, GPT, and T5…

Source

]]>
4
Joohoon Lee <![CDATA[Fast INT8 Inference for Autonomous Vehicles with TensorRT 3]]> http://www.open-lab.net/blog/parallelforall/?p=8755 2022-08-21T23:38:35Z 2017-12-12T01:25:40Z Autonomous driving demands safety, and a high-performance computing solution to process sensor data with extreme accuracy. Researchers and developers creating...]]>

Autonomous driving demands safety, and a high-performance computing solution to process sensor data with extreme accuracy. Researchers and developers creating deep neural networks (DNNs) for self driving must optimize their networks to ensure low-latency inference and energy efficiency. Thanks to a new Python API in NVIDIA TensorRT, this process just became easier. TensorRT is a high…

Source

]]>
6
���˳���97caoporen����