Sarah Yurick – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-24T18:32:13Z http://www.open-lab.net/blog/feed/ Sarah Yurick <![CDATA[Train a Reasoning-Capable LLM in One Weekend with NVIDIA NeMo]]> http://www.open-lab.net/blog/?p=103512 2025-07-24T18:32:13Z 2025-07-23T01:18:56Z Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA��s powerful...]]>

Have you ever wanted to build your own reasoning model but thought it was too complicated or required massive resources? Think again. With NVIDIA’s powerful tools and datasets, you can train a small, effective reasoning model in about 48 hours, all on a single GPU. Even better, we’ve made all the code available to you to get started right away. Let’s dive in.

Source

]]>
4
Sarah Yurick <![CDATA[Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=99540 2025-05-29T19:05:18Z 2025-05-07T16:22:31Z Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...]]>

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable developers to build highly accurate LLMs, NVIDIA previously released Nemotron-CC, a 6.3-trillion-token English language Common Crawl (CC) dataset. Today, the NVIDIA NeMo Curator team is excited to share that the pipeline used to build the…

Source

]]>
���˳���97caoporen����