Sean Sodha – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-06-26T18:54:02Z http://www.open-lab.net/blog/feed/ Sean Sodha <![CDATA[Run Multimodal Extraction for More Efficient AI Pipelines Using One GPU]]> http://www.open-lab.net/blog/?p=102474 2025-06-26T18:54:02Z 2025-06-18T20:31:51Z As enterprises generate and consume increasing volumes of diverse data, extracting insights from multimodal documents, like PDFs and presentations, has become a...]]>

As enterprises generate and consume increasing volumes of diverse data, extracting insights from multimodal documents, like PDFs and presentations, has become a major challenge. Traditional text-only extraction and basic retrieval-augmented generation (RAG) pipelines fall short, failing to capture the full value of these complex documents. The result? Missed insights, inefficient workflows…

Source

]]>
Sean Sodha <![CDATA[NVIDIA NeMo Retriever Delivers Accurate Multimodal PDF Data Extraction 15x Faster]]> http://www.open-lab.net/blog/?p=97161 2025-04-23T00:13:16Z 2025-03-18T19:20:51Z Enterprises are generating and storing more multimodal data than ever before, yet traditional retrieval systems remain largely text-focused. While they can...]]>

Enterprises are generating and storing more multimodal data than ever before, yet traditional retrieval systems remain largely text-focused. While they can surface insights from written content, they aren’t extracting critical information embedded in tables, charts, and infographics—often the most information-dense elements of a document. Without a multimodal retrieval system…

Source

]]>
Sean Sodha <![CDATA[Build an Enterprise-Scale Multimodal PDF Data Extraction Pipeline with an NVIDIA AI Blueprint]]> http://www.open-lab.net/blog/?p=87948 2024-11-14T04:04:51Z 2024-08-28T15:00:00Z Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images,...]]>

Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images, charts, and tables. This goldmine of data can only be used as quickly as humans can read and understand it. But with generative AI and retrieval-augmented generation (RAG), this untapped data can be used to uncover business insights that…

Source

]]>
���˳���97caoporen����