Walkthrough from pure Python implementation to multi-GPU Numba-jitted version

gmarkall · March 25, 2022, 5:48pm

The talk " Evaluating Your Options for Accelerated Numerical Computing in Pure Python" by my colleague Matthew Penn includes a walkthrough starting with a pure Python implementation of a k-Nearest Neighbours operand and goes through to a a multi-GPU jitted version with Numba. The trajectory it follows is:

The Pure Python version
CPU JIT with Numba
Parallel CPU jit with prange
GPU JIT with Numba’s @cuda.jit
Multi-GPU JIT with Numba and Dask

It also includes the use of an External Memory Manager (RMM, the RAPIDS Memory Manager) with Numba, and explains some optimization strategies for the GPU kernels.

I think this could be helpful for those looking to understand how to port from pure Python all the way up to nulti-GPU high-performance code, because it follows the typical steps that one needs to take to get there- Python → CPU JIT → GPU JIT → Parallel/multi-GPU.

Note that this talk also discusses some other options (CuPy, NumPy, etc.) but a large portion of the talk focuses on Numba, so I felt it was worthwhile sharing here. Note that although Matthew is a colleague of mine, I didn’t have any involvement in the preparation of the talk - it is all his work .

Recording: Attendee Portal (Registration for NVIDIA GTC is needed, which is free)
Slides: https://static.rainfocus.com/nvidia/gtcspring2022/sess/1638480642908001OycX/SessionFile/Evaluating%20Your%20Options%20for%20Accelerated%20Numerical%20Computing%20in%20Pure%20Python_1647528023707001MkTJ.pdf (not sure if registration is required here)
Example code (on Github, accessible without restriction): event-notebooks/GTC_Spring_2022/numerical-computing at main · rapidsai-community/event-notebooks · GitHub

Topic		Replies	Views
CPU vs GPU version Numba	2	475	July 28, 2020
The future of kernel programming style with Numba Numba	2	377	June 13, 2022
Best practices for developing large Numba-dependent projects with both CPU and GPU compatibility Numba	0	390	October 20, 2021
Explore the Mandelbrot Set in real time with Numba Showcase	4	1507	January 11, 2022
Feedback on tips for first-timers Community Support	14	972	August 15, 2023

Walkthrough from pure Python implementation to multi-GPU Numba-jitted version

Related topics