Zhiyong Ban – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-02-17T05:23:38Z http://www.open-lab.net/blog/feed/ Zhiyong Ban <![CDATA[Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2]]> http://www.open-lab.net/blog/?p=82196 2025-02-17T05:23:38Z 2024-05-13T17:17:38Z In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...]]>

In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo, and evaluating its performance. In this post, we walk you through curating a custom dataset and fine-tuning the model on that dataset. Custom data collection is crucial in model fine-tuning because it enables a model to adapt to…

Source

]]>
Zhiyong Ban <![CDATA[Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1]]> http://www.open-lab.net/blog/?p=82195 2024-05-30T19:55:58Z 2024-05-13T17:15:13Z Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...]]>

Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of attention-based transformer models has had a profound impact on complicated language modeling tasks, which predict the next upcoming token in the sentence. NMT is one of the typical instances. There are plenty of open-source NMT models…

Source

]]>
���˳���97caoporen����