Luis Ceze

Luis Ceze is VP of AI Systems Software at NVIDIA, focused on AI compilers and inference technology. He received his PhD in Computer Science from UIUC and is a Professor of Computer Science and Engineering at the University of Washington. His research interests are in efficient and agile AI systems and the intersection of AI and biology. He is a Fellow of the ACM.

Posts by Luis Ceze

Development & Optimization Jun 13, 2025

Run High-Performance LLM Inference Kernels from NVIDIA Using FlashInfer??

Best-in-class LLM Inference requires two key elements: speed and developer velocity. Speed refers to maximizing the efficiency of the underlying hardware by... 6 MIN READ