Tutorial on supporting Python User-Defined Functions in CUDA-accelerated Applications with Numba

gmarkall · March 25, 2022, 5:31pm

My GTC 2022 talk explains how to provide an API for User-Defined Functions in Python that get compiled into CUDA code using Numba. A lot of the talk is focused more generically on writing Numba extensions (as this is the vehicle for supporting an application’s data structures), so it may be of interest to those wishing to learn more about Numba’s internals, or extending Numba.

Abstract: “Many applications provide a Python API so that users can script their execution and extend their functionality — well-known examples include Blender, FreeCAD, and QGIS. Accelerated applications can also provide Python APIs; although this provides extra power and flexibility to the end user, these APIs are typically restricted to plumbing together calls to preexisting kernels provided by the application developer — it’s generally not possible for users to write their own CUDA kernels for the application in Python. Existing solutions to this problem entail writing CUDA C kernels — however, Python programmer productivity falls drastically when they need to write kernels in another language. Numba is a compiler that enables users to write their own CUDA kernels in Python. Learn how to integrate and extend Numba within an accelerated application so that users can implement high-performance extensions and workflows as user-defined functions within the accelerated application using only Python.”

Recording: https://events.rainfocus.com/widget/nvidia/gtcspring2022/sessioncatalog/session/16339878397050012ADx (note that registration for NVIDIA’s GPU Technology Conference (GTC) is required, which is free)

Slides: https://raw.githubusercontent.com/gmarkall/numba-accelerated-udfs/main/slides.pdf

Example repository: GitHub - gmarkall/numba-accelerated-udfs: Enabling Python User-Defined Functions in Accelerated Applications with Numba

Docker: Running the example code: docker run -p 8888:8888 gmarkall/filigree:v1 and open the “Filigree Demo” notebook.

Topic		Replies	Views
Numba for CUDA Programmers course released Announcements	0	786	April 23, 2021
Feb 25 Invited Talk: Faster and simpler Numba CUDA kernels using CUB and cuda.cooperative Announcements	1	249	February 28, 2025
Kernel within a kernel Support: How do I do ...?	10	1320	August 1, 2024
Cuda vs CPU maintenance Community Support	1	508	June 15, 2020
Walkthrough from pure Python implementation to multi-GPU Numba-jitted version Showcase	0	1456	March 25, 2022

Tutorial on supporting Python User-Defined Functions in CUDA-accelerated Applications with Numba

Related topics