Francesco Ciannella – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-07-24T18:32:30Z http://www.open-lab.net/blog/feed/ Francesco Ciannella <![CDATA[Enhancing Multilingual Human-Like Speech and Voice Cloning with NVIDIA Riva TTS]]> http://www.open-lab.net/blog/?p=102982 2025-07-24T18:32:30Z 2025-07-14T16:30:00Z While speech AI is used to build digital assistants and voice agents, its impact extends far beyond these applications. Core technologies like text-to-speech...]]>

While speech AI is used to build digital assistants and voice agents, its impact extends far beyond these applications. Core technologies like text-to-speech (TTS) and automatic speech recognition (ASR) are driving innovation across industries. They’re enabling real-time translation, powering interactive digital humans, and even helping restore speech for individuals who’ve lost their voices.

Source

]]>
Francesco Ciannella <![CDATA[Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=96151 2025-03-06T19:26:45Z 2025-02-26T17:00:00Z In today��s data-driven world, the ability to retrieve accurate information from even modest amounts of data is vital for developers seeking streamlined,...]]>

In today’s data-driven world, the ability to retrieve accurate information from even modest amounts of data is vital for developers seeking streamlined, effective solutions for quick deployments, prototyping, or experimentation. One of the key challenges in information retrieval is managing the diverse modalities in unstructured datasets, including text, PDFs, images, tables, audio, video…

Source

]]>
1
���˳���97caoporen����