Amit Bleiweiss – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-19T15:38:31Z http://www.open-lab.net/blog/feed/ Amit Bleiweiss <![CDATA[Spotlight: Qodo Innovates Efficient Code Search with NVIDIA DGX]]> http://www.open-lab.net/blog/?p=99041 2025-05-15T19:08:41Z 2025-04-23T22:23:32Z Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there...]]>

Large language models (LLMs) have enabled AI tools that help you write more code faster, but as we ask these tools to take on more and more complex tasks, there are limitations that become apparent. Challenges such as understanding the nuances of programming languages, complex dependencies, and adapting to codebase-specific context can lead to lower-quality code and cause bottlenecks down the line.

Source

]]>
Amit Bleiweiss <![CDATA[Spotlight: Drug Discovery Startup Protai Advances Complex Structure Prediction with AlphaFold, Proteomics, and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=96107 2025-04-23T02:44:24Z 2025-02-19T17:30:00Z Generative AI, especially with breakthroughs like AlphaFold and RosettaFold, is transforming drug discovery and how biotech companies and research laboratories...]]>

Generative AI, especially with breakthroughs like AlphaFold and RosettaFold, is transforming drug discovery and how biotech companies and research laboratories study protein structures, unlocking groundbreaking insights into protein interactions. Proteins are dynamic entities. It has been postulated that a protein’s native state is known by its sequence of amino acids alone…

Source

]]>
Amit Bleiweiss <![CDATA[Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=95756 2025-04-23T02:50:50Z 2025-02-05T21:30:00Z Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...]]>

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and technical terminology handling. The emergence of sovereign AI has highlighted critical challenges in large language models (LLMs), particularly their struggle to capture nuanced cultural and linguistic contexts beyond English-dominant…

Source

]]>
1
Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-05-19T15:38:31Z 2025-01-29T20:44:06Z Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...]]>

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and multifaceted nature of these systems. Unlike traditional machine learning (ML) models, LLMs generate a wide range of diverse and often unpredictable outputs, making standard evaluation metrics insufficient. Key challenges include the…

Source

]]>
Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Text Data Processing]]> http://www.open-lab.net/blog/?p=91738 2025-04-01T19:02:02Z 2024-11-13T18:05:06Z Training and customizing LLMs for high accuracy is fraught with challenges, primarily due to their dependency on high-quality data. Poor data quality and...]]>

Training and customizing LLMs for high accuracy is fraught with challenges, primarily due to their dependency on high-quality data. Poor data quality and inadequate volume can significantly reduce model accuracy, making dataset preparation a critical task for AI developers. Datasets frequently contain duplicate documents, personally identifiable information (PII), and formatting issues.

Source

]]>
Amit Bleiweiss <![CDATA[Spotlight: Dataloop Accelerates Multimodal Data Preparation Pipelines for LLMs with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=91071 2024-11-14T19:07:34Z 2024-11-12T17:00:00Z In the rapidly evolving landscape of AI, the preparation of high-quality datasets for large language models (LLMs) has become a critical challenge. It directly...]]>

In the rapidly evolving landscape of AI, the preparation of high-quality datasets for large language models (LLMs) has become a critical challenge. It directly affects a model’s accuracy, performance, and ability to generate reliable and unbiased outputs across diverse tasks and domains. Thanks to the partnership between NVIDIA and Dataloop, we are addressing this obstacle head-on…

Source

]]>
Amit Bleiweiss <![CDATA[Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas]]> http://www.open-lab.net/blog/?p=89625 2024-11-07T23:29:42Z 2024-10-01T16:00:00Z In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such...]]>

In the rapidly evolving field of medicine, the integration of cutting-edge technologies is crucial for enhancing patient care and advancing research. One such innovation is retrieval-augmented generation (RAG), which is transforming how medical information is processed and used. RAG combines the capabilities of large language models (LLMs) with external knowledge retrieval…

Source

]]>
Amit Bleiweiss <![CDATA[Spotlight: xpander AI Equips NVIDIA NIM Applications with Agentic Tools]]> http://www.open-lab.net/blog/?p=88694 2024-09-19T19:31:22Z 2024-09-11T17:21:53Z Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time...]]>

Equipping agentic AI applications with tools will usher in the next phase of AI. By enabling autonomous agents and other AI applications to fetch real-time data, perform actions, and interact with external systems, developers can bridge the gap to new, real-world use cases that significantly enhance productivity and the user experience. xpander AI, a member of the NVIDIA Inception program for…

Source

]]>
Amit Bleiweiss <![CDATA[Accelerating Hebrew LLM Performance with NVIDIA TensorRT-LLM]]> http://www.open-lab.net/blog/?p=86776 2024-08-22T18:25:40Z 2024-08-06T16:03:53Z Developing a high-performing Hebrew large language model (LLM) presents distinct challenges stemming from the rich and complex nature of the Hebrew language...]]>

Developing a high-performing Hebrew large language model (LLM) presents distinct challenges stemming from the rich and complex nature of the Hebrew language itself. The intricate structure of Hebrew, with words formed through root and pattern combinations, demands sophisticated modeling approaches. Moreover, the lack of capitalization and the frequent absence of punctuation like periods and commas…

Source

]]>
1
Amit Bleiweiss <![CDATA[Enhancing RAG Pipelines with Re-Ranking]]> http://www.open-lab.net/blog/?p=86037 2024-10-28T21:56:26Z 2024-07-30T16:00:00Z In the rapidly evolving landscape of AI-driven applications, re-ranking has emerged as a pivotal technique to enhance the precision and relevance of enterprise...]]>

In the rapidly evolving landscape of AI-driven applications, re-ranking has emerged as a pivotal technique to enhance the precision and relevance of enterprise search results. By using advanced machine learning algorithms, re-ranking refines initial search outputs to better align with user intent and context, thereby significantly improving the effectiveness of semantic search.

Source

]]>
Amit Bleiweiss <![CDATA[Deploy Multilingual LLMs with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=84933 2024-07-25T18:19:11Z 2024-07-08T18:49:33Z Multilingual large language models (LLMs) are increasingly important for enterprises operating in today's globalized business landscape. As businesses expand...]]>

Multilingual large language models (LLMs) are increasingly important for enterprises operating in today’s globalized business landscape. As businesses expand their reach across borders and cultures, the ability to communicate effectively in multiple languages is crucial for success. By supporting and investing in multilingual LLMs, enterprises can break down language barriers, foster inclusivity…

Source

]]>
3
Amit Bleiweiss <![CDATA[Training Localized Multilingual LLMs with NVIDIA NeMo, Part 2]]> http://www.open-lab.net/blog/?p=82295 2025-02-17T05:27:39Z 2024-05-17T17:29:49Z In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM��s tokenizer to form a multilingual tokenizer. In this post, we...]]>

In Part 1, we discussed how to train a monolingual tokenizer and merge it with a pretrained LLM’s tokenizer to form a multilingual tokenizer. In this post, we show you how to integrate the customized tokenizer into the pretrained LLM as well as how to start a continual pretraining task in NVIDIA NeMo. Please import the following libraries before starting: After…

Source

]]>
1
Amit Bleiweiss <![CDATA[Training Localized Multilingual LLMs with NVIDIA NeMo, Part 1]]> http://www.open-lab.net/blog/?p=82294 2024-10-18T20:22:45Z 2024-05-17T17:29:13Z In today's globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs)...]]>

In today’s globalized world, the ability of AI systems to understand and communicate in diverse languages is increasingly crucial. Large language models (LLMs) have revolutionized the field of natural language processing, enabling AI to generate human-like text, answer questions, and perform various language tasks. However, most mainstream LLMs are trained on data corpora that primarily consist of…

Source

]]>
3
Amit Bleiweiss <![CDATA[Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2]]> http://www.open-lab.net/blog/?p=82196 2025-02-17T05:23:38Z 2024-05-13T17:17:38Z In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...]]>

In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo, and evaluating its performance. In this post, we walk you through curating a custom dataset and fine-tuning the model on that dataset. Custom data collection is crucial in model fine-tuning because it enables a model to adapt to…

Source

]]>
Amit Bleiweiss <![CDATA[Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1]]> http://www.open-lab.net/blog/?p=82195 2024-05-30T19:55:58Z 2024-05-13T17:15:13Z Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...]]>

Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of attention-based transformer models has had a profound impact on complicated language modeling tasks, which predict the next upcoming token in the sentence. NMT is one of the typical instances. There are plenty of open-source NMT models…

Source

]]>
Amit Bleiweiss <![CDATA[Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints]]> http://www.open-lab.net/blog/?p=81895 2025-03-11T16:19:32Z 2024-05-08T16:00:00Z Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more...]]>

Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more accurate, up-to-date, and contextually relevant responses from large language models (LLMs). By incorporating data from various sources such as relational databases, unstructured document repositories, internet data streams, and media news feeds…

Source

]]>
7
Amit Bleiweiss <![CDATA[Tune and Deploy LoRA LLMs with NVIDIA TensorRT-LLM]]> http://www.open-lab.net/blog/?p=80481 2024-09-04T22:42:54Z 2024-04-02T17:00:00Z Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent...]]>

Large language models (LLMs) have revolutionized natural language processing (NLP) with their ability to learn from massive amounts of text and generate fluent and coherent texts for various tasks and domains. However, customizing LLMs is a challenging task, often requiring a full training process that is time-consuming and computationally expensive. Moreover, training LLMs requires a diverse and…

Source

]]>
2
Amit Bleiweiss <![CDATA[Deploy an AI Coding Assistant with NVIDIA TensorRT-LLM and NVIDIA Triton]]> http://www.open-lab.net/blog/?p=77200 2024-05-07T19:14:23Z 2024-02-01T21:00:00Z Large language models (LLMs) have revolutionized the field of AI, creating entirely new ways of interacting with the digital world. While they provide a good...]]>

Large language models (LLMs) have revolutionized the field of AI, creating entirely new ways of interacting with the digital world. While they provide a good generalized solution, they often must be tuned to support specific domains and tasks. AI coding assistants, or code LLMs, have emerged as one domain to help accomplish this. By 2025, 80% of the product development lifecycle will make…

Source

]]>
0
���˳���97caoporen����