Numba is the just-in-time compiler used in RAPIDS cuDF to implement high-performance user-defined functions (UDFs) by turning user-supplied Python functions into CUDA kernels. But how does it go from Python code to CUDA kernel? In this post, I discuss Numba��s compilation pipeline. If you enjoy diving into Numba��s internals, see the accompanying notebook that shows each stage in more depth��
]]>