Kyle Kranen – NVIDIA Technical Blog

Kyle Kranen – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-23T00:15:55Z http://www.open-lab.net/blog/feed/ Kyle Kranen <![CDATA[Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models]]> http://www.open-lab.net/blog/?p=95274 2025-04-23T00:15:55Z 2025-03-18T17:50:00Z

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for...]]>

NVIDIA announced the release of NVIDIA Dynamo today at GTC 2025. NVIDIA Dynamo is a high-throughput, low-latency open-source inference serving framework for deploying generative AI and reasoning models in large-scale distributed environments. The framework boosts the number of requests served by up to 30x, when running the open-source DeepSeek-R1 models on NVIDIA Blackwell.

]]> 2 Kyle Kranen <![CDATA[Applying Mixture of Experts in LLM Architectures]]> http://www.open-lab.net/blog/?p=79605 2024-06-06T14:53:24Z 2024-03-14T20:01:00Z

Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models...]]>

Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models with the open-source release of Mistral Mixtral 8x7B. The strong relative performance of the Mixtral model has raised much interest and numerous questions about MoE and its use in LLM architectures. So, what is MoE and why is it important?

]]> Kyle Kranen <![CDATA[Available Now: NVIDIA AI Accelerated DGL and PyG Containers for GNNs]]> http://www.open-lab.net/blog/?p=74698 2023-12-14T19:27:28Z 2023-12-08T22:07:12Z

From credit card transactions, social networks, and recommendation systems to transportation networks and protein-protein interactions in biology, graphs are...]]>

From credit card transactions, social networks, and recommendation systems to transportation networks and protein-protein interactions in biology, graphs are the go-to data structure for modeling and analyzing intricate connections. Graph neural networks (GNNs), with their ability to learn and reason over graph-structured data, have emerged as a game-changer across various domains. However…

]]> 0 Kyle Kranen <![CDATA[Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs]]> http://www.open-lab.net/blog/?p=55557 2023-12-05T18:55:13Z 2022-10-04T13:00:00Z

Fraud is a major problem for many financial services firms, costing billions of dollars each year, according to a recent Federal Trade Commission report....]]>

Fraud is a major problem for many financial services firms, costing billions of dollars each year, according to a recent Federal Trade Commission report. Financial fraud, fake reviews, bot assaults, account takeovers, and spam are all examples of online fraud and harmful activity. Although these firms employ techniques to combat online fraud, the methods can have severe limitations.

]]> 7 Kyle Kranen <![CDATA[Time Series Forecasting with the NVIDIA Time Series Prediction Platform and Triton Inference Server]]> http://www.open-lab.net/blog/?p=44168 2022-08-21T23:53:25Z 2022-02-15T16:00:00Z

In this post, we detail the recently released NVIDIA Time Series Prediction Platform (TSPP), a tool designed to compare easily and experiment with arbitrary...]]>

In this post, we detail the recently released NVIDIA Time Series Prediction Platform (TSPP), a tool designed to compare easily and experiment with arbitrary combinations of forecasting models, time-series datasets, and other configurations. The TSPP also provides functionality to explore the hyperparameter search space, run accelerated model training using distributed training and Automatic Mixed…

]]> 3 Kyle Kranen <![CDATA[Accelerating the Wide & Deep Model Workflow from 25 Hours to 10 Minutes Using NVIDIA GPUs]]> http://www.open-lab.net/blog/?p=29663 2024-10-28T19:02:41Z 2021-04-29T22:15:38Z

Recommender systems drive engagement on many of the most popular online platforms. As data volume grows exponentially, data scientists increasingly turn from...]]>

Recommender systems drive engagement on many of the most popular online platforms. As data volume grows exponentially, data scientists increasingly turn from traditional machine learning methods to highly expressive, deep learning models to improve recommendation quality. Often, the recommendations are framed as modeling the completion of a user-item matrix, in which the user-item entry is the…

]]> 1 ��˳��97caoporen��