Profiling Numba CUDA with nsight sys

Tried profiling this example:

(which Mr. Markall suggested as an answer to my previous question)

The example runs, nvidia-smi shows GPU activity, but profiling doesn’t show the GPU activity at all only much CPU activity.
Any ideas?

I run on a 7.5 compute capability GPU with this command:

nsys profile -w true -t cuda,nvtx,osrt,cudnn,cublas -s none -o nsight_report -f true -x true python kernel_progress.py

nsys is NSIght Systems, the whole-system profiler. You need to use NSight Compute (ncu) for profiling kernels. See: Nsight Compute CLI :: Nsight Compute Documentation.