RAPIDS

Jun 05, 2025
Supercharge Tree-Based Model Inference with Forest Inference Library in NVIDIA cuML
Tree-ensemble models remain a go-to for tabular data because they're accurate, comparatively inexpensive to train, and fast. But deploying Python inference on...
11 MIN READ

Jun 02, 2025
Supercharging Fraud Detection in Financial Services with Graph Neural Networks (Updated)
Note: This blog post was originally published on Oct. 28, 2024, but has been edited to reflect new updates. Fraud in financial services is a massive problem....
10 MIN READ

May 29, 2025
RAPIDS Brings Zero-Code-Change Acceleration, IO Performance Gains, and Out-of-Core XGBoost
Over the past two releases, RAPIDS introduced zero-code-change acceleration for Python machine learning, huge IO performance improvements, larger-than-memory...
10 MIN READ

May 22, 2025
Grandmaster Pro Tip: Winning First Place in a Kaggle Competition with Stacking Using cuML
What does it take to win a Kaggle competition in 2025? In the April Playground challenge, the goal was to predict how long users would listen to a podcast—and...
7 MIN READ

May 19, 2025
Spotlight: Atgenomix SeqsLab Scales Health Omics Analysis for Precision Medicine
In traditional clinical medical practice, treatment decisions are often based on general guidelines, past experiences, and trial-and-error approaches. Today,...
9 MIN READ

May 15, 2025
Simplify Setup and Boost Data Science in the Cloud using NVIDIA CUDA-X and Coiled
Imagine analyzing millions of NYC ride-share journeys—tracking patterns across boroughs, comparing service pricing, or identifying profitable pickup...
10 MIN READ

May 15, 2025
Predicting Performance on Apache Spark with GPUs
The world of big data analytics is constantly seeking ways to accelerate processing and reduce infrastructure costs. Apache Spark has become a leading platform...
9 MIN READ

May 08, 2025
Accelerate Deep Learning and LLM Inference with Apache Spark in the Cloud
Apache Spark is an industry-leading platform for big data processing and analytics. With the increasing prevalence of unstructured data—documents, emails,...
10 MIN READ

May 07, 2025
Building Nemotron-CC, A High-Quality Trillion Token Dataset for LLM Pretraining from Common Crawl Using NVIDIA NeMo Curator
Curating high-quality pretraining datasets is critical for enterprise developers aiming to train state-of-the-art large language models (LLMs). To enable...
7 MIN READ

May 01, 2025
Stacking Generalization with HPO: Maximize Accuracy in 15 Minutes with NVIDIA cuML
Stacking generalization is a widely used technique among machine learning (ML) engineers, where multiple models are combined to boost overall predictive...
7 MIN READ

Apr 10, 2025
Efficiently Scaling Polars GPU Parquet Reader
When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...
4 MIN READ

Apr 03, 2025
Accelerating Apache Parquet Scans on Apache Spark with GPUs
As data sizes have grown in enterprises across industries, Apache Parquet has become a prominent format for storing data. Apache Parquet is a columnar storage...
8 MIN READ

Mar 11, 2025
Efficient ETL with Polars and Apache Spark on NVIDIA Grace CPU
The NVIDIA Grace CPU Superchip delivers outstanding performance and best-in-class energy efficiency for CPU workloads in the data center and in the cloud. The...
7 MIN READ

Mar 06, 2025
Accelerate Apache Spark ML on NVIDIA GPUs with Zero Code Change
The NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It...
5 MIN READ

Mar 04, 2025
GPU-Accelerate Algorithmic Trading Simulations by over 100x with Numba
Quantitative developers need to run back-testing simulations to see how financial algorithms perform from a profit and loss (P&L) standpoint. Statistical...
12 MIN READ

Feb 27, 2025
High-Performance Remote IO With NVIDIA KvikIO
Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
9 MIN READ