Text Generation – NVIDIA Technical Blog

Text Generation – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-03T22:20:47Z http://www.open-lab.net/blog/feed/ Kazuki Fujii <![CDATA[Developing a 172B LLM with Strong Japanese Capabilities Using NVIDIA Megatron-LM]]> http://www.open-lab.net/blog/?p=91656 2024-11-14T19:32:44Z 2024-11-11T19:50:19Z

Generative AI has the ability to create entirely new content that traditional machine learning (ML) methods struggle to produce. In the field of natural...]]>

Generative AI has the ability to create entirely new content that traditional machine learning (ML) methods struggle to produce. In the field of natural...

nemo-megatron-mini-beat-promo-li-tw-2048x1024 copy

Generative AI has the ability to create entirely new content that traditional machine learning (ML) methods struggle to produce. In the field of natural language processing (NLP), the advent of large language models (LLMs) specifically has led to many innovative and creative AI use cases. These include customer support chatbots, voice assistants, text summarization and translation��

]]> 0 Maryam Ashoori <![CDATA[IBM��s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient]]> http://www.open-lab.net/blog/?p=90636 2024-11-22T23:09:36Z 2024-10-21T19:15:35Z

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...]]>

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...

IBM Granite Models NVIDIA

Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on domain-specific use cases; the latest IBM Granite models meet or exceed the performance of leading similarly sized open models across both academic and enterprise benchmarks. The developer-friendly Granite 3.0 generative AI models are��

]]> 0 Chintan Patel <![CDATA[New NIM Available: Mistral Large 2 Instruct LLM]]> http://www.open-lab.net/blog/?p=87308 2024-08-22T18:24:59Z 2024-08-13T20:37:24Z

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...]]>

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and...

Mistral Large 2407

The new model by Mistral excels at a variety of complex tasks including text summarization, multilingual translation and reasoning, programming, question and answering, and conversational AI.

]]> 0 Hannah Simmons <![CDATA[Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA]]> http://www.open-lab.net/blog/?p=84548 2024-07-10T15:28:34Z 2024-06-26T16:44:52Z

Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.]]>

Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.

Llama3 NVIDIA API catalog

Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.

]]> 0 Hannah Simmons <![CDATA[Breeze-7B: LLM Specialized for Traditional Chinese]]> http://www.open-lab.net/blog/?p=83334 2024-06-13T19:06:04Z 2024-06-03T17:00:00Z

The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.]]>

The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.

Breeze_API NVIDIA featured

The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.

]]> 0 Hannah Simmons <![CDATA[BGE-M3: Advanced Multilingual Text Retrieval Model]]> http://www.open-lab.net/blog/?p=83341 2024-06-13T19:06:03Z 2024-06-03T17:00:00Z

Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...]]>

Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...

BGE_APICatalog NVIDIA featured

Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense, multi-vector, and sparse retrieval for inputs from short sentences to long documents.

]]> 1 Hannah Simmons <![CDATA[Generate Text Responses from Visual and Text Inputs with Google��s New PaliGemma Model]]> http://www.open-lab.net/blog/?p=82533 2024-06-07T21:15:13Z 2024-05-14T18:46:00Z

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.]]>

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.

Google's New PaliGemma Model

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.

]]> 0 Brad Nemire <![CDATA[Top Retrieval-Augmented Generation (RAG) Sessions at NVIDIA GTC 2024 Sessions]]> http://www.open-lab.net/blog/?p=77562 2024-06-06T16:14:28Z 2024-02-06T19:38:44Z

Join us in-person or virtually and learn about the power of RAG with insights and best practices from experts at NVIDIA, visionary CEOs, data scientists, and...]]>

Join us in-person or virtually and learn about the power of RAG with insights and best practices from experts at NVIDIA, visionary CEOs, data scientists, and... Retrieval-Augmented Generation Conference Sessions at GTC

Retrieval-Augmented Generation Conference Sessions at GTC

Join us in-person or virtually and learn about the power of RAG with insights and best practices from experts at NVIDIA, visionary CEOs, data scientists, and others.

]]> 0 Shashank Verma <![CDATA[Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model]]> http://www.open-lab.net/blog/?p=74346 2024-10-28T22:00:06Z 2023-11-28T18:10:50Z

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation...]]>

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation...

GenAI - Promo - AWS -DevNews-PRESS-1920x1080

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation enterprise productivity applications, they enhance user efficiency across tasks like programming, copy editing, brainstorming, and answering questions on a wide range of topics. However, these models often struggle with real-time events and��

]]> 0 Zhilin Wang <![CDATA[Announcing HelpSteer: An Open-Source Dataset for Building Helpful LLMs]]> http://www.open-lab.net/blog/?p=73937 2024-01-03T23:48:02Z 2023-11-27T17:00:00Z

NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model...]]>

NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model...

Announcing HelpSteer An Open Source Dataset for Building Helpful LLMs

NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model (LLM) responses during inference. The developer community has shown great interest in using the approach for building custom LLMs. The NVIDIA NeMo team is now open-sourcing a multi-attribute dataset called Helpfulness SteerLM dataset��

]]> 0 Brad Nemire <![CDATA[Early Bird Pricing Now Open for Hands-on Training at GTC]]> http://www.open-lab.net/blog/?p=73447 2024-06-06T16:22:29Z 2023-11-20T16:00:00Z

Register for expert-led technical workshops at NVIDIA GTC and save with early bird pricing through February 7, 2024.]]>

Register for expert-led technical workshops at NVIDIA GTC and save with early bird pricing through February 7, 2024.

gtc24-spring-dli-early-email-thumbnail-600x338

Register for expert-led technical workshops at NVIDIA GTC and save with early bird pricing through February 7, 2024.

]]> 0 Anjali Shah <![CDATA[Mastering LLM Techniques: Training?]]> http://www.open-lab.net/blog/?p=73464 2024-01-22T22:05:25Z 2023-11-16T14:00:00Z

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...]]>

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and...

llm-visual-mastering-large-language-model-training-2968826-r1

Large language models (LLMs) are a class of generative AI models built using transformer networks that can recognize, summarize, translate, predict, and generate language using very large datasets. LLMs have the promise of transforming society as we know it, yet training these foundation models is incredibly challenging. This blog articulates the basic principles behind LLMs��

]]> 0 Nirmal Kumar Juluru <![CDATA[Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models?]]> http://www.open-lab.net/blog/?p=73688 2024-01-02T18:37:01Z 2023-11-15T16:00:00Z

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...]]>

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...

ngc-ai-summit-blog-2973793-1920x1080

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the accelerated infrastructure, and optimizing the models. Developers can begin with pretrained models and fine-tune them for their use case, saving time and getting their solutions faster to market. Developers need an easy way to try out models��

]]> 0 Vivienne Zhang <![CDATA[NVIDIA AI Foundation Models: Build Custom Enterprise Chatbots and Co-Pilots with Production-Ready LLMs]]> http://www.open-lab.net/blog/?p=73296 2024-11-20T23:03:22Z 2023-11-15T16:00:00Z

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning....]]>

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning.... An illustration representing Nemotron-3-8b model family.

An illustration representing Nemotron-3-8b model family.

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning. Custom LLMs, tailored for domain-specific insights, are finding increased traction in enterprise applications. The NVIDIA Nemotron-3 8B family of foundation models is a powerful new tool for building production-ready generative AI��

]]> 4 Abhishek Sawarkar <![CDATA[Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning]]> http://www.open-lab.net/blog/?p=73312 2023-12-30T00:41:50Z 2023-11-15T16:00:00Z

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement,...]]>

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement,... Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning

Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement, and foster innovation. Given its tremendous value, enterprises are looking for tools and expertise that help them integrate this new technology into their business operations and strategies effectively and reliably.

]]> 0 Shawn Davis <![CDATA[Generative AI and Accelerated Computing for Spear Phishing Detection]]> http://www.open-lab.net/blog/?p=70728 2023-10-05T18:18:13Z 2023-09-12T18:00:00Z

Spear phishing is the largest and most costly form of cyber threat, with an estimated 300,000 reported victims in 2021 representing $44 million in reported...]]>

Spear phishing is the largest and most costly form of cyber threat, with an estimated 300,000 reported victims in 2021 representing $44 million in reported...

NVIDIA Spear Phishing

Spear phishing is the largest and most costly form of cyber threat, with an estimated 300,000 reported victims in 2021 representing $44 million in reported losses in the United States alone. Business e-mail compromises led to $2.4 billion in costs in 2021, according to the FBI Internet Crime Report. In the period from June 2016 to December 2021, costs related to phishing and spear phishing totaled��

]]> 0 Michelle Horton <![CDATA[Event: Speech AI Day]]> http://www.open-lab.net/blog/?p=69814 2023-08-24T19:18:11Z 2023-08-21T19:24:00Z

On Sept. 20, join experts from leading companies at NVIDIA-hosted Speech AI Day.]]>

On Sept. 20, join experts from leading companies at NVIDIA-hosted Speech AI Day. Speech AI Day promo asset, showing an illustration of several people from different countries standing around a globe saying hello in their language.

Speech AI Day promo asset, showing an illustration of several people from different countries standing around a globe saying hello in their language.

On Sept. 20, join experts from leading companies at NVIDIA-hosted Speech AI Day.

]]> 0 Anjali Shah <![CDATA[Mastering LLM Techniques: Customization]]> http://www.open-lab.net/blog/?p=68897 2023-12-08T18:54:22Z 2023-08-10T16:30:00Z

Large language models (LLMs) are becoming an integral tool for businesses to improve their operations, customer interactions, and decision-making processes....]]>

Large language models (LLMs) are becoming an integral tool for businesses to improve their operations, customer interactions, and decision-making processes.... Decorative image.

Decorative image.

Large language models (LLMs) are becoming an integral tool for businesses to improve their operations, customer interactions, and decision-making processes. However, off-the-shelf LLMs often fall short in meeting the specific needs of enterprises due to industry-specific terminology, domain expertise, or unique requirements. This is where custom LLMs come into play.

]]> 0 Annie Surla <![CDATA[How to Get Better Outputs from Your Large Language Model]]> http://www.open-lab.net/blog/?p=66169 2023-11-03T07:14:59Z 2023-06-14T16:18:05Z

Large language models (LLMs) have generated excitement worldwide due to their ability to understand and process human language at a scale that is unprecedented....]]>

Large language models (LLMs) have generated excitement worldwide due to their ability to understand and process human language at a scale that is unprecedented....

LLM workflow demo.

Large language models (LLMs) have generated excitement worldwide due to their ability to understand and process human language at a scale that is unprecedented. It has transformed the way that we interact with technology. Having been trained on a vast corpus of text, LLMs can manipulate and generate text for a wide variety of applications without much instruction or training. However��

]]> 0 Tanay Varshney <![CDATA[An Introduction to Large Language Models: Prompt Engineering and P-Tuning]]> http://www.open-lab.net/blog/?p=63707 2023-11-28T19:18:25Z 2023-04-26T16:00:00Z

ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal...]]>

ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal...

Large Language Model Basics: Prompt Engineering and P-Tuning

ChatGPT has made quite an impression. Users are excited to use the AI chatbot to ask questions, write poems, imbue a persona for interaction, act as a personal assistant, and more. Large language models (LLMs) power ChatGPT, and these models are the topic of this post. Before considering LLMs more carefully, we would first like to establish what a language model does. A language model gives��

]]> 0 Annamalai Chockalingam <![CDATA[NVIDIA Enables Trustworthy, Safe, and Secure Large Language Model Conversational Systems]]> http://www.open-lab.net/blog/?p=63745 2024-11-20T23:04:35Z 2023-04-25T13:00:00Z

Large language models (LLMs) are incredibly powerful and capable of answering complex questions, performing feats of creative writing, developing, debugging...]]>

Large language models (LLMs) are incredibly powerful and capable of answering complex questions, performing feats of creative writing, developing, debugging...

NeMo Guardrails illustration.

Large language models (LLMs) are incredibly powerful and capable of answering complex questions, performing feats of creative writing, developing, debugging source code, and so much more. You can build incredibly sophisticated LLM applications by connecting them to external tools, for example reading data from a real-time source, or enabling an LLM to decide what action to take given a user��s��

]]> 1 Annamalai Chockalingam <![CDATA[NVIDIA Announces Generative AI Services for Language, Visual Content, and Biology Applications]]> http://www.open-lab.net/blog/?p=62329 2024-02-15T19:14:48Z 2023-03-22T16:00:00Z

Generative AI is primed to transform the world��s industries and to solve today��s most important challenges. To enable enterprises to take advantage of the...]]>

Generative AI is primed to transform the world��s industries and to solve today��s most important challenges. To enable enterprises to take advantage of the... Sunset, molecule, and avatar composite.

Generative AI is primed to transform the world��s industries and to solve today��s most important challenges. To enable enterprises to take advantage of the possibilities with generative AI, NVIDIA has launched NVIDIA AI Foundations and the NVIDIA NeMo framework, powered by NVIDIA DGX Cloud. NVIDIA AI Foundations are a family of cloud services that provide enterprises with a simplified��

]]> 0 Abhishek Verma <![CDATA[Supercharging AI Video and AI Inference Performance with NVIDIA L4 GPUs]]> http://www.open-lab.net/blog/?p=62109 2023-10-25T23:51:25Z 2023-03-21T17:00:00Z

NVIDIA T4 was introduced 4 years ago as a universal GPU for use in mainstream servers. T4 GPUs achieved widespread adoption and are now the highest-volume...]]>

NVIDIA T4 was introduced 4 years ago as a universal GPU for use in mainstream servers. T4 GPUs achieved widespread adoption and are now the highest-volume... Picture of L4 GPU on a black background.

Picture of L4 GPU on a black background.

NVIDIA T4 was introduced 4 years ago as a universal GPU for use in mainstream servers. T4 GPUs achieved widespread adoption and are now the highest-volume NVIDIA data center GPU. T4 GPUs were deployed into use cases for AI inference, cloud gaming, video, and visual computing. At the NVIDIA GTC 2023 keynote, NVIDIA introduced several inference platforms for AI workloads��

]]> 1 Nicola Sessions <![CDATA[NVIDIA Morpheus Helps Defend Against Spear Phishing with Generative AI]]> http://www.open-lab.net/blog/?p=62189 2023-03-23T17:12:11Z 2023-03-21T16:50:53Z

Using generative AI and the NVIDIA Morpheus cybersecurity AI framework, developers can build solutions that detect spear phishing attempts more effectively and...]]>

Using generative AI and the NVIDIA Morpheus cybersecurity AI framework, developers can build solutions that detect spear phishing attempts more effectively and... Mail icon GIF

Mail icon GIF

Using generative AI and the NVIDIA Morpheus cybersecurity AI framework, developers can build solutions that detect spear phishing attempts more effectively and with extremely short training times. In fact, using NVIDIA Morpheus and a generative AI training technique, we were able to detect 90% of targeted spear phishing emails��a 20% improvement compared to a typical phishing detection solution��

]]> 2 Vanessa Braunstein <![CDATA[Build Generative AI Pipelines for Drug Discovery with NVIDIA BioNeMo Service]]> http://www.open-lab.net/blog/?p=61944 2023-06-09T22:35:27Z 2023-03-21T15:50:00Z

Creating new drug candidates is a heroic endeavor, often taking over 10 years to bring a drug to market. New supercomputing-scale large language models (LLMs)...]]>

Creating new drug candidates is a heroic endeavor, often taking over 10 years to bring a drug to market. New supercomputing-scale large language models (LLMs)...

bionemo_featured

Creating new drug candidates is a heroic endeavor, often taking over 10 years to bring a drug to market. New supercomputing-scale large language models (LLMs) that understand biology and chemistry text are helping scientists understand proteins, small molecules, DNA, and biomedical text. These state-of-the-art AI models help generate de novo proteins and molecules and predict the 3D��

]]> 1 Vinh Nguyen <![CDATA[How to Create a Custom Language Model]]> http://www.open-lab.net/blog/?p=61684 2023-06-13T17:55:25Z 2023-03-15T17:00:00Z

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative...]]>

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative... Abstract image

Abstract image

Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative models are able to generate human-quality results, from well-articulated children��s stories to product prototype visualizations. Large language models (LLMs) are at the center of this revolution. LLMs are universal language comprehenders��

]]> 0 Michelle Horton <![CDATA[Top Generative AI Sessions at NVIDIA GTC 2023]]> http://www.open-lab.net/blog/?p=61120 2023-03-14T19:14:11Z 2023-02-17T19:14:12Z

See how recent breakthroughs in generative AI are transforming media, content creation, personalized experiences, and more. ]]>

See how recent breakthroughs in generative AI are transforming media, content creation, personalized experiences, and more. Four images showcasing generative AI, with an avatar, illustration of DNA strand, a medley of zoo animals, and code.

See how recent breakthroughs in generative AI are transforming media, content creation, personalized experiences, and more.

]]> 0 ��˳��97caoporen��