Sylendran Arunagiri – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-24T18:33:37Z http://www.open-lab.net/blog/feed/ Sylendran Arunagiri <![CDATA[Reinforcement Learning with NVIDIA NeMo-RL: Reproducing a DeepScaleR Recipe Using GRPO]]> http://www.open-lab.net/blog/?p=102915 2025-07-24T18:33:16Z 2025-07-09T19:00:00Z Reinforcement learning (RL) is the backbone of interactive AI. It is fundamental for teaching agents to reason and learn from human preferences, enabling...]]>

Reinforcement learning (RL) is the backbone of interactive AI. It is fundamental for teaching agents to reason and learn from human preferences, enabling multiturn tool use, and much more. This post introduces NVIDIA NeMo-RL, a new open source post-training library that is built to support everything from single-GPU prototypes to thousand-GPU large models and to orchestrate multicomponent RL…

Source

]]>
Sylendran Arunagiri <![CDATA[New Video: Build Self-Improving AI Agents with the NVIDIA Data Flywheel Blueprint]]> http://www.open-lab.net/blog/?p=102853 2025-07-24T18:33:21Z 2025-07-03T18:41:57Z AI agents powered by large language models are transforming enterprise workflows, but high inference costs and latency can limit their scalability and user...]]>

AI agents powered by large language models are transforming enterprise workflows, but high inference costs and latency can limit their scalability and user experience. To address this, NVIDIA recently announced the NVIDIA AI Blueprint for Building Data Flywheels. It’s an enterprise-ready workflow that helps optimize AI agents by automated experimentation to find efficient models that reduce…

Source

]]>
Sylendran Arunagiri <![CDATA[Upcoming Livestream: Beyond the Algorithm With NVIDIA]]> http://www.open-lab.net/blog/?p=102468 2025-07-24T18:33:37Z 2025-06-24T15:00:00Z Join us on June 26 to learn how to distill cost-efficient models with the NVIDIA Data Flywheel Blueprint.]]>

Join us on June 26 to learn how to distill cost-efficient models with the NVIDIA Data Flywheel Blueprint.

Source

]]>
Sylendran Arunagiri <![CDATA[Build Efficient AI Agents Through Model Distillation With the NVIDIA Data Flywheel Blueprint]]> http://www.open-lab.net/blog/?p=101424 2025-06-12T18:48:35Z 2025-06-11T11:00:00Z As enterprise adoption of agentic AI accelerates, teams face a growing challenge of scaling intelligent applications while managing inference costs. Large...]]>

As enterprise adoption of agentic AI accelerates, teams face a growing challenge of scaling intelligent applications while managing inference costs. Large language models (LLMs) offer strong performance but come with substantial computational demands, often resulting in high latency and costs. At the same time, many development workflows—such as evaluation, data curation…

Source

]]>
Sylendran Arunagiri <![CDATA[Maximize AI Agent Performance with Data Flywheels Using NVIDIA NeMo Microservices]]> http://www.open-lab.net/blog/?p=97046 2025-04-23T00:15:03Z 2025-03-18T19:05:30Z As agentic AI systems evolve and become essential for optimizing business processes, it is crucial for developers to update them regularly to stay aligned with...]]>

As agentic AI systems evolve and become essential for optimizing business processes, it is crucial for developers to update them regularly to stay aligned with ever-changing business and user needs. Continuously refining these agents with AI and human feedback ensures that they remain effective and relevant. NVIDIA NeMo microservices is a fully accelerated, enterprise-grade solution designed…

Source

]]>
���˳���97caoporen����