Bita Darvish Rouhani

Bita Darvish Rouhani is a distinguished engineer and manager at NVIDIA, leading algorithms, software, and hardware co-design initiatives for cost-optimized generative AI inference. Prior to joining Nvidia, Bita was a partner group manager at Microsoft, where she co-founded and led the OCP MX consortium. This consortium has standardized the first set of 4- and 6-bit data types for AI training and inference for nearly all mainstream AI chips. Bita holds a Ph.D. in computer engineering from UC San Diego.

Posts by Bita Darvish Rouhani

Development & Optimization Jul 07, 2025

Think Smart and Ask an Encyclopedia-Sized Question: Multi-Million Token Real-Time Inference for 32X More Users

Modern AI applications increasingly rely on models that combine huge parameter counts with multi-million-token context windows. Whether it is AI agents... 8 MIN READ

Data Center / Cloud Jun 06, 2025

How NVIDIA GB200 NVL72 and NVIDIA Dynamo Boost Inference Performance for MoE Models

The latest wave of open source large language models (LLMs), like DeepSeek R1, Llama 4, and Qwen3, have embraced Mixture of Experts (MoE) architectures. Unlike... 12 MIN READ