Synthetic Data Generation – NVIDIA Technical Blog

Synthetic Data Generation – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-23T19:27:29Z http://www.open-lab.net/blog/feed/ Vinh Nguyen <![CDATA[Build Custom Reasoning Models with Advanced, Open Post-Training Datasets]]> http://www.open-lab.net/blog/?p=98680 2025-05-15T19:07:23Z 2025-05-14T16:33:26Z

Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from...]]>

Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from... How the Llama-Nemotron 30M Post Training Dataset was created

How the Llama-Nemotron 30M Post Training Dataset was created

Synthetic data has become a standard part of large language model (LLM) post-training procedures. Using a large number of synthetically generated examples from either a single or cohort of open-source, commercially permissible LLMs, a base LLM is finetuned either with supervised finetuning or RLHF to gain instruction-following and reasoning skills. This process can be seen as a knowledge��

]]> 0 Nirmal Kumar Juluru <![CDATA[Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=99540 2025-05-15T19:07:43Z 2025-05-07T16:22:31Z

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...]]>

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...

Building Nemotron-CC image copy

Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable developers to build highly accurate LLMs, NVIDIA previously released Nemotron-CC, a 6.3-trillion-token English language Common Crawl (CC) dataset. Today, the NVIDIA NeMo Curator team is excited to share that the pipeline used to build the��

]]> 0 Vinay Raman <![CDATA[Evaluating and Enhancing RAG Pipeline Performance Using Synthetic Data?]]> http://www.open-lab.net/blog/?p=97927 2025-05-15T06:26:42Z 2025-04-07T18:39:06Z

As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal...]]>

As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal... Decorative image.

Decorative image.

As large language models (LLM) gain popularity in various question-answering systems, retrieval-augmented generation (RAG) pipelines have also become a focal point. RAG pipelines combine the generation power of LLMs with external data sources and retrieval mechanisms, enabling models to access domain-specific information that may not have existed during fine-tuning.

]]> 0 Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-05-19T15:38:31Z 2025-01-29T20:44:06Z

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...]]>

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...

data-with-magnifying-glass-graphic

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and multifaceted nature of these systems. Unlike traditional machine learning (ML) models, LLMs generate a wide range of diverse and often unpredictable outputs, making standard evaluation metrics insufficient. Key challenges include the��

]]> 0 Pranjali Joshi <![CDATA[Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform]]> http://www.open-lab.net/blog/?p=94577 2025-01-23T19:54:31Z 2025-01-09T17:42:06Z

As robotics and autonomous vehicles advance, accelerating development of physical AI��which enables autonomous machines to perceive, understand, and perform...]]>

As robotics and autonomous vehicles advance, accelerating development of physical AI��which enables autonomous machines to perceive, understand, and perform...

hardhat-composite-nvidia-cosmos-workflow

As robotics and autonomous vehicles advance, accelerating development of physical AI��which enables autonomous machines to perceive, understand, and perform complex actions in the physical world��has become essential. At the center of these systems are world foundation models (WFMs)��AI models that simulate physical states through physics-aware videos, enabling machines to make accurate decisions and��

]]> 1 Akhil Docca <![CDATA[How to Build a Generative AI-Enabled Synthetic Data Pipeline for Perception-Based Physical AI]]> http://www.open-lab.net/blog/?p=86105 2025-01-09T19:23:08Z 2025-01-07T03:57:00Z

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...]]>

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of...

warehouse-with-forklift-compilation

Training physical AI models used to power autonomous machines, such as robots and autonomous vehicles, requires huge amounts of data. Acquiring large sets of diverse training data can be difficult, time-consuming, and expensive. Data is often limited due to privacy restrictions or concerns, or simply may not exist for novel use cases. In addition, the available data may not apply to the full range��

]]> 0 Monika Jhuria <![CDATA[Scaling Action Recognition Models with Synthetic Data]]> http://www.open-lab.net/blog/?p=91593 2024-12-12T19:35:22Z 2024-12-03T18:36:55Z

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...]]>

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...

nv-metropolis-aws-reinvent-24-featured

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking, waving, or picking up objects. While the concept is well-established, the challenge lies in building a robust computer vision model that can accurately recognize the range of actions across different scenarios that are domain- or use case��

]]> 0 Chintan Patel <![CDATA[Improve Reinforcement Learning from Human Feedback with Leaderboard-Topping Reward Model]]> http://www.open-lab.net/blog/?p=89583 2024-11-04T22:57:33Z 2024-09-30T19:21:18Z

Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific...]]>

Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific...

rag-visual-blog-1920x1080

Llama 3.1 Nemotron 70B Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific research, telecommunications, and sovereign AI.

]]> 0 Shuo Wang <![CDATA[Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking]]> http://www.open-lab.net/blog/?p=87901 2024-09-05T17:57:21Z 2024-08-27T18:30:00Z

This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...]]>

This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...

warehouse-with-overlay

This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning process to enhance system accuracy in the first part and second part. NVIDIA Metropolis is an application framework and set of developer tools that leverages AI for visual data analysis across industries. Its multi-camera tracking reference��

]]> 0 Chris Alexiuk <![CDATA[Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4-340B]]> http://www.open-lab.net/blog/?p=84322 2024-10-04T21:38:35Z 2024-08-16T16:15:56Z

[stextbox id="info"]The Llama-3.1-Nemotron 70B-Reward model helps generate high-quality training data that aligns with human preferences for finance, retail,...]]>

[stextbox id="info"]The Llama-3.1-Nemotron 70B-Reward model helps generate high-quality training data that aligns with human preferences for finance, retail,...

NVIDIA Nemotron featured blog

The Llama-3.1-Nemotron 70B-Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific research, telecommunications, and sovereign AI. This post was updated on August 16, 2024 to reflect the most recent Reward Bench results. Since the introduction and subsequent wide adoption of large language models (LLMs)��

]]> 1 Tanay Varshney <![CDATA[Creating Synthetic Data Using Llama 3.1 405B]]> http://www.open-lab.net/blog/?p=85922 2024-08-08T18:48:35Z 2024-07-23T15:15:00Z

Synthetic data isn��t about creating new information. It's about transforming existing information to create different variants. For over a decade, synthetic...]]>

Synthetic data isn��t about creating new information. It's about transforming existing information to create different variants. For over a decade, synthetic... An illustration representing syntheti

An illustration representing syntheti

Synthetic data isn��t about creating new information. It��s about transforming existing information to create different variants. For over a decade, synthetic data has been used to improve model accuracy across the board��whether it is transforming images to improve object detection models, strengthening fraudulent credit card detection, or improving BERT models for QA. What��s new?

]]> 0 Pengfei Guo <![CDATA[Addressing Medical Imaging Limitations with Synthetic Data Generation]]> http://www.open-lab.net/blog/?p=83468 2025-02-04T19:51:06Z 2024-06-24T17:50:59Z

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...]]>

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...

medical-imaging-sdg

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is limited. This reduces the costs and labor associated with annotating real images. Synthetic data also provides an ethical alternative to using sensitive patient data, which helps with education and training without compromising patient privacy.

]]> 0 Monika Jhuria <![CDATA[Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim]]> http://www.open-lab.net/blog/?p=83470 2024-07-30T22:15:36Z 2024-06-24T17:00:00Z

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...]]>

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning ideas into reality in weeks rather than months with NVIDIA Metropolis AI workflows and microservices. In this post, we explore Metropolis microservices features: Managing and automating infrastructure with AI is��

]]> 0 Jenny Plunkett <![CDATA[How to Train an Object Detection Model for Visual Inspection with Synthetic Data]]> http://www.open-lab.net/blog/?p=70820 2024-06-17T16:44:02Z 2024-05-31T22:30:00Z

AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...]]>

AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...

Edge Impulse Conveyor Belt Feature Image

AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or incorrect parts during assembly. Computer vision can help identify problems with products early on, reducing the chances of them being delivered to customers. However, developing accurate and versatile object detection models remains��

]]> 0 Tianna Nguy <![CDATA[NVIDIA GTC Training Labs On Demand Available Now]]> http://www.open-lab.net/blog/?p=82157 2024-05-07T17:47:17Z 2024-05-07T17:02:57Z

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.]]>

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

nearly 100 training labs from GTC available on demand

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

]]> 0 Adithya Murali <![CDATA[Automating Smart Pick-and-Place with Intrinsic Flowstate and NVIDIA Isaac Manipulator]]> http://www.open-lab.net/blog/?p=81758 2024-05-15T17:16:37Z 2024-05-06T15:00:00Z

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in...]]>

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in... Picture of cardboard in bin.

Picture of cardboard in bin.

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in industrial manufacturing are still completed by human operators as it is still challenging to program robots for these tasks. For instance, in a machine-tending setting, a collaborative robot could be used to pick raw material parts from a��

]]> 0 Nate Bradford <![CDATA[Top Synthetic Data Generation Sessions at NVIDIA GTC 2024]]> http://www.open-lab.net/blog/?p=78671 2024-03-07T19:18:48Z 2024-02-29T23:31:18Z

Learn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.]]>

Learn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.

Collage of four computer vision images.

Learn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.

]]> 0 Asawaree Bhide <![CDATA[Generate Synthetic Data for Deep Object Pose Estimation Training with NVIDIA Isaac ROS]]> http://www.open-lab.net/blog/?p=75640 2024-08-06T20:51:29Z 2024-01-18T21:45:18Z

For robotic agents to interact with objects in their environment, they must know the position and orientation of objects around them. This information describes...]]>

For robotic agents to interact with objects in their environment, they must know the position and orientation of objects around them. This information describes... An image of the inside of an industrial manufacturing building.

An image of the inside of an industrial manufacturing building.

For robotic agents to interact with objects in their environment, they must know the position and orientation of objects around them. This information describes the six degrees of freedom (DOF) pose of a rigid body in 3D space, detailing the translational and rotational state. Accurate pose estimation is necessary to determine how to orient a robotic arm to grasp or place objects in a��

]]> 0 Gautham Sholingar <![CDATA[Using Synthetic Data to Address Novel Viewpoints for Autonomous Vehicle Perception]]> http://www.open-lab.net/blog/?p=72916 2024-01-10T18:06:50Z 2023-11-13T16:58:27Z

Autonomous vehicles (AV) come in all shapes and sizes, ranging from small passenger cars to multi-axle semi-trucks. However, a perception algorithm deployed on...]]>

Autonomous vehicles (AV) come in all shapes and sizes, ranging from small passenger cars to multi-axle semi-trucks. However, a perception algorithm deployed on... Using Synthetic Data to Address Novel Viewpoints for AV Perception

Using Synthetic Data to Address Novel Viewpoints for AV Perception

Autonomous vehicles (AV) come in all shapes and sizes, ranging from small passenger cars to multi-axle semi-trucks. However, a perception algorithm deployed on these vehicles must be trained to handle similar situations, like avoiding an obstacle or a pedestrian. The datasets used to develop and validate these algorithms are typically collected by one type of vehicle�� for example sedans��

]]> 0 Rishabh Chadha <![CDATA[How to Train Autonomous Mobile Robots to Detect Warehouse Pallet Jacks Using Synthetic Data]]> http://www.open-lab.net/blog/?p=72201 2023-11-02T20:22:40Z 2023-10-29T16:55:33Z

Synthetic data can play a key role when training perception AI models that are deployed on autonomous mobile robots (AMRs). This process is becoming...]]>

Synthetic data can play a key role when training perception AI models that are deployed on autonomous mobile robots (AMRs). This process is becoming...

warehouse-pallet-jacks

Synthetic data can play a key role when training perception AI models that are deployed on autonomous mobile robots (AMRs). This process is becoming increasingly important in manufacturing. For an example of using synthetic data to generate a pretrained model that can detect pallets in a warehouse, see Developing a Pallet Detection Model Using OpenUSD and Synthetic Data.

]]> 0 Bhumin Pathak <![CDATA[Boost Synthetic Data Generation with Low-Code Workflows in NVIDIA Omniverse Replicator 1.10]]> http://www.open-lab.net/blog/?p=71526 2023-11-02T18:14:39Z 2023-10-18T14:00:00Z

Data is the lifeblood of AI systems, which rely on robust datasets to learn and make predictions or decisions. For perception AI models specifically, it is...]]>

Data is the lifeblood of AI systems, which rely on robust datasets to learn and make predictions or decisions. For perception AI models specifically, it is...

bounding-box-composite

Data is the lifeblood of AI systems, which rely on robust datasets to learn and make predictions or decisions. For perception AI models specifically, it is essential that data reflects real-world environments and incorporates the array of scenarios. This includes edge use cases for which data is often difficult to collect, such as street traffic and manufacturing assembly lines.

]]> 0 Matthias Lehnen <![CDATA[Validating NVIDIA DRIVE Sim Radar Models]]> http://www.open-lab.net/blog/?p=70950 2023-10-19T19:06:01Z 2023-09-26T17:01:11Z

Sensor simulation is a critical tool to address the gaps in real-world data for autonomous vehicle (AV) development. However, it is only effective if sensor...]]>

Sensor simulation is a critical tool to address the gaps in real-world data for autonomous vehicle (AV) development. However, it is only effective if sensor...

real-radar-nvidia-drive-sim-radar-gif

Sensor simulation is a critical tool to address the gaps in real-world data for autonomous vehicle (AV) development. However, it is only effective if sensor models accurately reflect the physical world. Sensors can be either passive, such as cameras��or active, sending out either an electromagnetic wave (lidar, radar) or an acoustic wave (ultrasonic) to generate the sensor output.

]]> 0 Akhil Docca <![CDATA[Developing Smart City Traffic Management Systems with OpenUSD and Synthetic Data]]> http://www.open-lab.net/blog/?p=68622 2023-08-10T17:11:11Z 2023-08-01T17:00:00Z

Smart cities are the future of urban living. Yet they can present various challenges for city planners, most notably in the realm of transportation. To be...]]>

Smart cities are the future of urban living. Yet they can present various challenges for city planners, most notably in the realm of transportation. To be...

cars-license-plates

Smart cities are the future of urban living. Yet they can present various challenges for city planners, most notably in the realm of transportation. To be successful, various aspects of the city��from environment and infrastructure to business and education��must be functionally integrated. This can be difficult, as managing traffic flow alone is a complex problem full of challenges such as��

]]> 0 John Welsh <![CDATA[Developing a Pallet Detection Model Using OpenUSD and Synthetic Data]]> http://www.open-lab.net/blog/?p=68102 2023-07-27T19:34:48Z 2023-07-18T18:00:00Z

Imagine you are a robotics or machine learning (ML) engineer tasked with developing a model to detect pallets so that a forklift can manipulate them. ?You are...]]>

Imagine you are a robotics or machine learning (ML) engineer tasked with developing a model to detect pallets so that a forklift can manipulate them. ?You are... Stacked pallets

Stacked pallets

Imagine you are a robotics or machine learning (ML) engineer tasked with developing a model to detect pallets so that a forklift can manipulate them. ?You are familiar with traditional deep learning pipelines, you have curated manually annotated datasets, and you have trained successful models. You are ready for the next challenge, which comes in the form of large piles of densely stacked��

]]> 7 Gautham Sholingar <![CDATA[Bringing Far-Field Objects into Focus with Synthetic Data for Camera-Based AV Perception]]> http://www.open-lab.net/blog/?p=64879 2023-05-24T03:44:38Z 2023-05-18T17:21:07Z

Detecting far-field objects, such as vehicles that are more than 100 m away, is fundamental for automated driving systems to maneuver safely while operating on...]]>

Detecting far-field objects, such as vehicles that are more than 100 m away, is fundamental for automated driving systems to maneuver safely while operating on...

highway-scene-with-measurement-sidebar

Detecting far-field objects, such as vehicles that are more than 100 m away, is fundamental for automated driving systems to maneuver safely while operating on highways. In such high-speed environments, every second counts. Thus, if the perception range of an autonomous vehicle (AV) can be increased from 100 m to 200 m while traveling at 70 mph, the vehicle has significantly more time to��

]]> 0 Nathan Horrocks <![CDATA[Latest NVIDIA Graphics Research Advances Generative AI��s Next Frontier]]> http://www.open-lab.net/blog/?p=64303 2023-06-13T17:36:45Z 2023-05-03T19:28:21Z

NVIDIA will present 19 research papers at SIGGRAPH, the year��s most important computer graphics conference.]]>

NVIDIA will present 19 research papers at SIGGRAPH, the year��s most important computer graphics conference. GIF of woman with hair being manipulated by generative AI.

GIF of woman with hair being manipulated by generative AI.

NVIDIA will present 19 research papers at SIGGRAPH, the year��s most important computer graphics conference.

]]> 0 Ankush Agarwal <![CDATA[Exelon Uses Synthetic Data Generation of Grid Infrastructure to Automate Drone Inspection]]> http://www.open-lab.net/blog/?p=64182 2023-06-09T23:06:09Z 2023-05-01T21:07:04Z

Most drone inspections still require a human to manually inspect the video for defects. Computer vision can help automate and accelerate this inspection...]]>

Most drone inspections still require a human to manually inspect the video for defects. Computer vision can help automate and accelerate this inspection...

exelon-sdg-utility-poles-featured-image copy

Most drone inspections still require a human to manually inspect the video for defects. Computer vision can help automate and accelerate this inspection process. However, training a computer vision model to automate inspection is difficult without a large pool of labeled data for every possible defect. In a recent session at NVIDIA GTC, we shared how Exelon is using synthetic data generation��

]]> 0 Akhil Docca <![CDATA[How to Train a Defect Detection Model Using Synthetic Data with NVIDIA Omniverse Replicator]]> http://www.open-lab.net/blog/?p=63357 2023-11-30T17:33:37Z 2023-04-19T18:30:00Z

According to the American Society of Quality (ASQ), defects cost manufacturers nearly 20% of overall sales revenue. The products that we interact with on a...]]>

According to the American Society of Quality (ASQ), defects cost manufacturers nearly 20% of overall sales revenue. The products that we interact with on a... Car and scratched panel

Car and scratched panel

According to the American Society of Quality (ASQ), defects cost manufacturers nearly 20% of overall sales revenue. The products that we interact with on a daily basis��like phones, cars, televisions, and computers��must be manufactured with precision so that they can deliver value in varying conditions and scenarios. AI-based computer vision applications are helping to catch defects in the��

]]> 2 Marq Rasmussen <![CDATA[Create Realistic Robotics Simulations with ROS 2 MoveIt and NVIDIA Isaac Sim]]> http://www.open-lab.net/blog/?p=63193 2023-07-11T22:59:35Z 2023-04-18T16:30:00Z

MoveIt is a robotic manipulation platform that incorporates the latest advances in motion planning, manipulation, 3D perception, kinematics, control, and...]]>

MoveIt is a robotic manipulation platform that incorporates the latest advances in motion planning, manipulation, 3D perception, kinematics, control, and... Robotic arm

Robotic arm

MoveIt is a robotic manipulation platform that incorporates the latest advances in motion planning, manipulation, 3D perception, kinematics, control, and navigation. PickNik Robotics, the company leading the development of MoveIt, is exploring the use of NVIDIA Isaac Sim in an internal R&D project. The project goals are to improve perception for manipulation and augment with MoveIt Studio, PickNik��

]]> 2 James Cameron <![CDATA[Bootstrapping Object Detection Model Training with 3D Synthetic Data]]> http://www.open-lab.net/blog/?p=61031 2023-04-06T18:15:28Z 2023-03-29T17:00:00Z

Training AI models requires mountains of data. Acquiring large sets of training data can be difficult, time-consuming, and expensive. Also, the data collected...]]>

Training AI models requires mountains of data. Acquiring large sets of training data can be difficult, time-consuming, and expensive. Also, the data collected... Three crates containing various fruits

Three crates containing various fruits

Training AI models requires mountains of data. Acquiring large sets of training data can be difficult, time-consuming, and expensive. Also, the data collected may not be able to cover various corner cases, preventing the AI model from accurately predicting a wide variety of scenarios. Synthetic data offers an alternative to real-world data, enabling AI researchers and engineers to bootstrap��

]]> 13 Jason Black <![CDATA[Top Robotics Sessions at NVIDIA GTC 2023?]]> http://www.open-lab.net/blog/?p=61191 2023-03-01T01:31:56Z 2023-02-21T21:00:00Z

Get to know the NVIDIA technologies and software development tools powering the latest in robotics and edge AI.]]>

Get to know the NVIDIA technologies and software development tools powering the latest in robotics and edge AI. An illustration of 3 different robots in a warehouse setting.

An illustration of 3 different robots in a warehouse setting.

Get to know the NVIDIA technologies and software development tools powering the latest in robotics and edge AI.

]]> 0 Jason Black <![CDATA[Upcoming Event: The Next Frontier of Computer Vision: Simulation and Synthetic Data]]> http://www.open-lab.net/blog/?p=61004 2023-08-21T22:58:15Z 2023-02-15T16:30:00Z

Learn how simulation and synthetic data are transforming vision AI applications at the NVIDIA Metropolis meetup on February 22 and 23.]]>

Learn how simulation and synthetic data are transforming vision AI applications at the NVIDIA Metropolis meetup on February 22 and 23. A digitalized street with cars, bikes and pedestrians.

A digitalized street with cars, bikes and pedestrians.

Learn how simulation and synthetic data are transforming vision AI applications at the NVIDIA Metropolis meetup on February 22 and 23.

]]> 0 Michelle Horton <![CDATA[Top Synthetic Data Sessions at NVIDIA GTC 2023]]> http://www.open-lab.net/blog/?p=60968 2023-06-09T22:41:27Z 2023-02-10T20:55:15Z

Discover how 3D synthetic data generation is accelerating AI and simulation workflows.]]>

Discover how 3D synthetic data generation is accelerating AI and simulation workflows. A GIF showing an illustrated warehouse from a low view point with flashing lights and colors.

A GIF showing an illustrated warehouse from a low view point with flashing lights and colors.

Discover how 3D synthetic data generation is accelerating AI and simulation workflows.

]]> 0 Steven Butrimas <![CDATA[Validating NVIDIA DRIVE Sim Lidar Models]]> http://www.open-lab.net/blog/?p=59294 2023-08-23T17:29:28Z 2023-01-20T20:21:11Z

Autonomous vehicle development is all about scale. Engineers must collect and label massive amounts of data to train self-driving neural networks. This...]]>

Autonomous vehicle development is all about scale. Engineers must collect and label massive amounts of data to train self-driving neural networks. This...

drive-sim-active-sensors-featured

Autonomous vehicle development is all about scale. Engineers must collect and label massive amounts of data to train self-driving neural networks. This data is then used to test and validate the AV system, which is also an immense undertaking to ensure robustness. Simulation is an important tool to reach this level of scale, but accuracy is key to its effectiveness. NVIDIA DRIVE Sim��

]]> 0 Gavriel State <![CDATA[Rapidly Generate 3D Assets for Virtual Worlds with Generative AI]]> http://www.open-lab.net/blog/?p=59197 2023-12-06T23:42:18Z 2023-01-03T16:45:00Z

To accelerate the development of 3D worlds and the metaverse, NVIDIA has launched numerous AI research projects to help creators across industries unlock new...]]>

To accelerate the development of 3D worlds and the metaverse, NVIDIA has launched numerous AI research projects to help creators across industries unlock new... A group of different animals standing together.

A group of different animals standing together.

To accelerate the development of 3D worlds and the metaverse, NVIDIA has launched numerous AI research projects to help creators across industries unlock new possibilities with generative AI. Generative AI will touch every aspect of the metaverse and it is already being leveraged for use cases like bringing AI avatars to life with Omniverse ACE. Many of these projects��

]]> 0 Nyla Worker <![CDATA[Accelerate AI Training Faster Than Ever with New NVIDIA Omniverse Replicator Capabilities]]> http://www.open-lab.net/blog/?p=54990 2023-04-18T19:24:14Z 2022-09-21T14:44:00Z

Announced at GTC, technical artists, software developers, and ML engineers can now build custom, physically accurate, synthetic data generation pipelines in the...]]>

Announced at GTC, technical artists, software developers, and ML engineers can now build custom, physically accurate, synthetic data generation pipelines in the...

nv-gtc22-fall-social-replicator

Announced at GTC, technical artists, software developers, and ML engineers can now build custom, physically accurate, synthetic data generation pipelines in the cloud with NVIDIA Omniverse Replicator. Omniverse Replicator is a highly extensible framework built on the NVIDIA Omniverse platform that enables physically accurate 3D synthetic data generation to accelerate the training and accuracy��

]]> 0 Kshitiz Gupta <![CDATA[Closing the Sim2Real Gap with NVIDIA Isaac Sim and NVIDIA Isaac Replicator]]> http://www.open-lab.net/blog/?p=50768 2023-05-23T23:17:52Z 2022-07-27T16:00:00Z

Synthetic data is an important tool in training machine learning models for computer vision applications. Researchers from NVIDIA have introduced a structured...]]>

Synthetic data is an important tool in training machine learning models for computer vision applications. Researchers from NVIDIA have introduced a structured...

trimble-sdg-featured

Synthetic data is an important tool in training machine learning models for computer vision applications. Researchers from NVIDIA have introduced a structured domain randomization system within Omniverse Replicator that can help you train and refine models using synthetic data. Omniverse Replicator is an SDK built on the NVIDIA Omniverse platform that enables you to build custom synthetic��

]]> 1 Nyla Worker <![CDATA[Build Custom Synthetic Data Generation Pipelines with Omniverse Replicator]]> http://www.open-lab.net/blog/?p=49562 2023-05-23T23:18:11Z 2022-06-27T13:00:00Z

Companies providing synthetic data generation tools and services, as well as developers, can now build custom physically accurate synthetic data generation...]]>

Companies providing synthetic data generation tools and services, as well as developers, can now build custom physically accurate synthetic data generation...

Companies providing synthetic data generation tools and services, as well as developers, can now build custom physically accurate synthetic data generation pipelines with the Omniverse Replicator SDK. Built on the NVIDIA Omniverse platform, the Omniverse Replicator SDK is available in beta within Omniverse Code. Omniverse Replicator is a highly extensible SDK built on a scalable Omniverse��

]]> 1 Michelle Horton <![CDATA[Visual Components Connector for NVIDIA Omniverse: A Perfect Recipe for Manufacturing Digitalization]]> http://www.open-lab.net/blog/?p=49087 2022-09-09T16:15:07Z 2022-06-14T20:20:27Z

]]>

Vis Components Medium Quick Link Image

]]> 0 Christian Gartland <![CDATA[Better Together: Accelerating AI Model Development with Lexset Synthetic Data and NVIDIA TAO]]> http://www.open-lab.net/blog/?p=47276 2023-06-12T20:38:09Z 2022-05-23T16:00:00Z

To develop an accurate computer vision AI application, you need massive amounts of high-quality data. With a traditional dataset, you might spend months...]]>

To develop an accurate computer vision AI application, you need massive amounts of high-quality data. With a traditional dataset, you might spend months...

lexset-featured

To develop an accurate computer vision AI application, you need massive amounts of high-quality data. With a traditional dataset, you might spend months collecting images, getting annotations, and cleaning data. When it��s done, you could find edge cases and need more data, starting the cycle all over again. For years, this cycle has held back AI, especially in computer vision.

]]> 0 Jochen Papenbrock <![CDATA[Best Practices: Explainable AI Powered by Synthetic Data]]> http://www.open-lab.net/blog/?p=48282 2022-11-23T21:55:04Z 2022-05-20T16:00:00Z

Data sits at the heart of model explainability. Explainable AI (XAI) is a rapidly advancing field looking to provide insights into the complex decision-making...]]>

Data sits at the heart of model explainability. Explainable AI (XAI) is a rapidly advancing field looking to provide insights into the complex decision-making...

best-practices-1260

Data sits at the heart of model explainability. Explainable AI (XAI) is a rapidly advancing field looking to provide insights into the complex decision-making processes of AI algorithms. Where AI has a significant impact on individuals�� lives, like credit risk scoring, managers and consumers alike rightfully demand insight into these decisions. leading financial institutions are already��

]]> 0 Jonathan Stephens <![CDATA[Getting Started with NVIDIA Instant NeRFs]]> http://www.open-lab.net/blog/?p=48071 2023-11-30T22:51:29Z 2022-05-12T20:04:36Z

The new NVIDIA NGP Instant NeRF is a great introduction to getting started with neural radiance fields. In as little as an hour, you can compile the codebase,...]]>

The new NVIDIA NGP Instant NeRF is a great introduction to getting started with neural radiance fields. In as little as an hour, you can compile the codebase,... A neural radiance field rendering an image of an excavator in a 3d scene

A neural radiance field rendering an image of an excavator in a 3d scene

The new NVIDIA NGP Instant NeRF is a great introduction to getting started with neural radiance fields. In as little as an hour, you can compile the codebase, prepare your images, and train your first NeRF. Unlike other NeRF implementations, Instant NeRF only takes a few minutes to train a great-looking visual. In my hands-on video (embedded), I walk you through the ins and outs of making��

]]> 6 Yi Dong <![CDATA[Generating Synthetic Data with Transformers: A Solution for Enterprise Data Challenges]]> http://www.open-lab.net/blog/?p=47308 2023-03-14T23:24:51Z 2022-05-09T16:00:00Z

Big data, new algorithms, and fast computation are three main factors that make the modern AI revolution possible. However, data poses many challenges for...]]>

Big data, new algorithms, and fast computation are three main factors that make the modern AI revolution possible. However, data poses many challenges for...

rendered2

Big data, new algorithms, and fast computation are three main factors that make the modern AI revolution possible. However, data poses many challenges for enterprises: difficulty in data labeling, ineffective data governance, limited data availability, data privacy, and so on. Synthetically generated data is a potential solution to address these challenges because it generates data points by��

]]> 0 Amey Kulkarni <![CDATA[Developing and Deploying AI-powered Robots with NVIDIA Isaac Sim and NVIDIA TAO]]> http://www.open-lab.net/blog/?p=47373 2023-02-13T17:39:02Z 2022-05-05T18:44:26Z

From building cars to helping surgeons and delivering pizzas, robots not only automate but also speed up human tasks manyfold. With the advent of AI, you can...]]>

From building cars to helping surgeons and delivering pizzas, robots not only automate but also speed up human tasks manyfold. With the advent of AI, you can...

tao-sdg-featured

From building cars to helping surgeons and delivering pizzas, robots not only automate but also speed up human tasks manyfold. With the advent of AI, you can build even smarter robots that can better perceive their surroundings and make decisions with minimal human intervention. Take, for instance, an autonomous robot used in warehouses to move payloads from one place to another.

]]> 0 Jason Black <![CDATA[Upcoming Webinar:?How Synthetic Data is Supercharging Vision AI Development]]> http://www.open-lab.net/blog/?p=47539 2023-08-18T19:37:34Z 2022-04-29T16:43:53Z

]]>

Metropolis-meetup-social_0518

]]> 0 Titus Capilnean <![CDATA[Overcoming Data Collection and Augmentation Roadblocks with NVIDIA TAO Toolkit and Appen Data Annotation Platform]]> http://www.open-lab.net/blog/?p=43282 2023-02-13T18:55:48Z 2022-01-25T19:15:12Z

Building AI models from scratch requires enormous amounts of data, time, money, and expertise. This is at odds with what it takes to succeed in the AI space:...]]>

Building AI models from scratch requires enormous amounts of data, time, money, and expertise. This is at odds with what it takes to succeed in the AI space:...

appen-data

Building AI models from scratch requires enormous amounts of data, time, money, and expertise. This is at odds with what it takes to succeed in the AI space: fast time-to-market and the ability to quickly evolve and customize solutions. NVIDIA TAO, an AI-Model-Adaptation framework, enables you to leverage production-quality, pretrained AI models and fine-tune them in a fraction of the time��

]]> 0 Gerard Andrews <![CDATA[NVIDIA Omniverse Replicator Generates Synthetic Training Data for Robots]]> http://www.open-lab.net/blog/?p=40215 2023-04-18T19:24:29Z 2021-11-09T19:07:00Z

AI pioneer Andrew Ng is calling for a broad shift to a more data-centric approach to machine learning (ML). He recently held the first data-centric AI...]]>

AI pioneer Andrew Ng is calling for a broad shift to a more data-centric approach to machine learning (ML). He recently held the first data-centric AI... An image of a forklift in Isaac Sim

An image of a forklift in Isaac Sim

AI pioneer Andrew Ng is calling for a broad shift to a more data-centric approach to machine learning (ML). He recently held the first data-centric AI competition on data quality, which many claim represents 80% of the work in AI. ��I��m optimistic that the AI community before long will take as much interest in systematically improving data as architecting models,�� Ng wrote in his newsletter��

]]> 10 Jakub Pietrzak <![CDATA[Advancing AI Sports Analytics Through the Data-Driven SKY ENGINE AI Platform and NVIDIA RTX]]> http://www.open-lab.net/blog/?p=38925 2022-08-21T23:52:55Z 2021-11-01T15:00:00Z

Building training and testing playgrounds to help advance sport analytics AI solutions out of the lab and into the real world is exceedingly challenging. In...]]>

Building training and testing playgrounds to help advance sport analytics AI solutions out of the lab and into the real world is exceedingly challenging. In...

sports-analytics-featured-1

Building training and testing playgrounds to help advance sport analytics AI solutions out of the lab and into the real world is exceedingly challenging. In team-based sports, building a correct playing strategy before the championship season is a key to success for any professional coach and club owner. While coaches strive at providing best tips and point out mistakes during the game��

]]> 0 Gerard Andrews <![CDATA[NVIDIA Isaac ROS Delivers AI Perception to ROS Developers]]> http://www.open-lab.net/blog/?p=38865 2023-02-10T22:13:29Z 2021-10-21T17:00:16Z

Perceiving and understanding the surrounding world is a critical challenge for autonomous robots. In conjunction with ROS World 2021, NVIDIA announced its...]]>

Perceiving and understanding the surrounding world is a critical challenge for autonomous robots. In conjunction with ROS World 2021, NVIDIA announced its... Image shows Carter V2 Mobile Robot with semantic Lidar in warehouse scene.

Image shows Carter V2 Mobile Robot with semantic Lidar in warehouse scene.

Perceiving and understanding the surrounding world is a critical challenge for autonomous robots. In conjunction with ROS World 2021, NVIDIA announced its latest efforts to deliver performant perception technologies to the ROS developer community. These initiatives will accelerate product development, improve product performance, and ultimately simplify the task of incorporating cutting-edge��

]]> 0 Nyla Worker <![CDATA[Trimble Explores Acceleration of Autonomous Robot Training with Synthetic Data Generation and NVIDIA Isaac Sim]]> http://www.open-lab.net/blog/?p=37995 2022-11-23T21:55:05Z 2021-09-30T18:58:23Z

Deploying an autonomous robot to a new environment can be a tough proposition. How can you gain confidence that the robot��s perception capabilities are robust...]]>

Deploying an autonomous robot to a new environment can be a tough proposition. How can you gain confidence that the robot��s perception capabilities are robust...

trimble-spot-featured

Deploying an autonomous robot to a new environment can be a tough proposition. How can you gain confidence that the robot��s perception capabilities are robust enough, so it performs safely and as planned? Trimble faced this challenge when it started building plans to deploy Boston Dynamics�� Spot in a variety of indoor settings and construction environments. Trimble needed to tune the machine��

]]> 1 Eric Leonard <![CDATA[Accelerating Blender Python Using CUDA]]> http://www.open-lab.net/blog/?p=32956 2022-08-21T23:51:57Z 2021-07-02T20:00:00Z

Simulated or synthetic data generation is an important emerging trend in the development of AI tools. Classically, these datasets can be used to address...]]>

Simulated or synthetic data generation is an important emerging trend in the development of AI tools. Classically, these datasets can be used to address...

GPU-blender-hero

Simulated or synthetic data generation is an important emerging trend in the development of AI tools. Classically, these datasets can be used to address low-data problems or edge-case scenarios that might now be present in available real-world datasets. Emerging applications for synthetic data include establishing model performance levels, quantifying the domain of applicability��

]]> 0 Patrick Rodriguez <![CDATA[Preparing Models for Object Detection with Real and Synthetic Data and NVIDIA TAO Toolkit]]> http://www.open-lab.net/blog/?p=32450 2022-11-23T21:55:05Z 2021-06-10T16:00:00Z

The long, cumbersome slog of data procurement has been slowing down innovation in AI, especially in computer vision, which relies on labeled images and video...]]>

The long, cumbersome slog of data procurement has been slowing down innovation in AI, especially in computer vision, which relies on labeled images and video...

classification-computer-vision

The long, cumbersome slog of data procurement has been slowing down innovation in AI, especially in computer vision, which relies on labeled images and video for training. But now you can jumpstart your machine learning process by quickly generating synthetic data using AI.Reverie. With the AI.Reverie synthetic data platform, you can create the exact training data that you need in a fraction��

]]> 10 Sirisha Rella <![CDATA[Integrating with Data Generation and Labeling Tools for Accurate AI Training]]> http://www.open-lab.net/blog/?p=30162 2023-03-22T01:11:53Z 2021-04-12T19:31:00Z

Data plays a crucial role in creating intelligent applications. To create an efficient AI/ ML app, you must train machine learning models with high-quality,...]]>

Data plays a crucial role in creating intelligent applications. To create an efficient AI/ ML app, you must train machine learning models with high-quality,...

data-generation-labeling-tools

Data plays a crucial role in creating intelligent applications. To create an efficient AI/ ML app, you must train machine learning models with high-quality, labeled datasets. Generating and labeling such data from scratch has been a critical bottleneck for enterprises. Many companies prefer a one-stop solution to support their AI/ML workflow from data generation, data labeling, model training/

]]> 0 Aayush Prakash <![CDATA[Sim2SG: Generating Sim-to-Real Scene Graphs for Transfer Learning]]> http://www.open-lab.net/blog/?p=24213 2022-08-21T23:41:05Z 2021-02-25T00:53:58Z

Scene graphs (SGs) in both computer vision and computer graphics are an interpretable and structural representation of scenes. A scene graph summarizes entities...]]>

Scene graphs (SGs) in both computer vision and computer graphics are an interpretable and structural representation of scenes. A scene graph summarizes entities...

sim2sg_figure1

Scene graphs (SGs) in both computer vision and computer graphics are an interpretable and structural representation of scenes. A scene graph summarizes entities in the scene and plausible relationships among them. SGs have applications in the fields of computer vision, robotics, autonomous vehicles, and so on. Current SG-generation techniques rely on the limited availability of expensive��

]]> 2 ��˳��97caoporen��