Tried profiling this example:
(which Mr. Markall suggested as an answer to my previous question)
The example runs, nvidia-smi shows GPU activity, but profiling doesn’t show the GPU activity at all only much CPU activity.
I run on a 7.5 compute capability GPU with this command:
nsys profile -w true -t cuda,nvtx,osrt,cudnn,cublas -s none -o nsight_report -f true -x true python kernel_progress.py