How do I profile code including numba kernels decorated with @cuda.jit? Is there a good way to apply nsight systems and nsight compute tools that replace nvprof?

dstorti · April 8, 2023, 10:11pm

Does anyone have up to date advice on how to profile numba.cuda kernels? I used to be able to get some basic info using nvprof which is now replaced by nsight sytems and nsight compute. Can anyone offer advice (or better yet an example) of how to use the new tools?

Alternatively, is there something new incorporated into numba to provide kernel profiling info? Any suggestions would be appreciated.

gmarkall · April 11, 2023, 9:27am

This Deep Learning Institute workshop from the recent GTC23 explains how to use NSight Compute with Numba, including correlating Python source code with the profile info:

The Numba part starts at about 1h04 in.

dstorti · April 11, 2023, 5:04pm

Just what I was looking for. Thank you for the prompt and very helpful response. (I guess I should be looking to expand my DLI certification…)

dstorti · May 11, 2023, 12:47am

I appreciated the previous help on accessing nsight compute. Now I am trying to an analyze the impact of using multiple streams to overlap compute and data transfer, so I am really looking for something like the timelines produced by nsight systems.
Is there a good way to get that for python/numba?
Does the early part of the same DLI presentation that discuss NVTX provide a way to access such information?

gmarkall · May 12, 2023, 8:25am

I’m not sure, as I haven’t watched the earlier part of the presentation - I was aware with the later part about NSight because I provided some assistance in putting it together.

NVTX can be used from Python with the NVTX Python wrapper - perhaps this is a useful starting point? References:

GitHub - NVIDIA/NVTX: The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
nvtx · PyPI

Topic		Replies	Views
Profiling Numba CUDA with nsight sys Support: How do I do ...?	1	979	July 5, 2022
How to profile with nsight compute? Support: How do I do ...?	1	723	January 25, 2022
[ANN] Profila, a line profiler for Numba (initial release) Showcase	3	429	May 15, 2024
The following packages have unmet dependencies: Numba	1	507	November 14, 2022
[ANN] Profila v0.2, the Numba line profiler, now supports profiling Jupyter cells Showcase	2	250	May 15, 2024

How do I profile code including numba kernels decorated with @cuda.jit? Is there a good way to apply nsight systems and nsight compute tools that replace nvprof?

Related topics