Posts by Shashank Verma
Generative AI
May 12, 2025
Run Hugging Face Models Instantly with Day-0 Support from NVIDIA NeMo Framework
As organizations strive to maximize the value of their generative AI investments, accessing the latest model developments is crucial to continued success. By...
6 MIN READ
Generative AI
Apr 23, 2025
Enhance Your AI Agent with Data Flywheels Using NVIDIA NeMo Microservices
Enterprise data is constantly changing. This presents significant challenges for maintaining AI system accuracy over time. As organizations increasingly rely on...
12 MIN READ
Generative AI
Feb 12, 2025
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
10 MIN READ
Generative AI
Aug 16, 2024
Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4-340B
This post was updated on August 16, 2024 to reflect the most recent Reward Bench results. Since the introduction and subsequent wide adoption of large language...
8 MIN READ
Generative AI
Jul 10, 2024
Customizing NVIDIA NIM for Domain-Specific Needs with NVIDIA NeMo
Large language models (LLMs) adopted for specific enterprise applications most often benefit from model customization. Enterprises need to tailor ?LLMs for...
11 MIN READ
Generative AI
Jun 07, 2024
Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM
The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...
11 MIN READ