CUDA - Nvprof error?

JRibeiro · January 6, 2021, 12:14am

I’m very new to nvprof and I’m trying to profile a kernel (more information can be found here GPU function apparently blocking due to data size/complexity).

I get a warning but that doesn’t seem to be the bigger issue.
The very first time I ask for something to be done towards the GPU:

spans_matrix = cuda.to_device(aux_spans_matrix)

apparently results in the error seen below. Can someone tell me if I’m doing something wrong?

==6056== Warning: Child processes are not profiled. Use option --profile-child-processes to profile them.

Traceback (most recent call last):
  File "Segmentation.py", line 958, in <module>
    spans_matrix = cuda.to_device(aux_spans_matrix)
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\devices.py", line 223, in _require_cuda_context
    with _runtime.ensure_context():
  File "C:\...\AppData\Local\Programs\Python\Python38\lib\contextlib.py", line 113, in __enter__
    return next(self.gen)
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\devices.py", line 121, in ensure_context
    with driver.get_active_context():
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\driver.py", line 393, in __enter__
    driver.cuCtxGetCurrent(byref(hctx))
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\driver.py", line 280, in __getattr__
    self.initialize()
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\driver.py", line 237, in initialize
    self.cuInit(0)
  File "D:\...\Segmentation\venv\lib\site-packages\numba\cuda\cudadrv\driver.py", line 299, in safe_cuda_api_call
    retcode = libfn(*args)
OSError: exception: access violation writing 0x0000000000000024
======== Warning: No CUDA application was profiled, exiting
======== Error: Application returned non-zero code 1

uchytilc · January 7, 2021, 10:54pm

It looks like from the stack trace the code produced an error when trying to initialize CUDA. You might want to try running nvprof as administrator if you weren’t already.

JRibeiro · January 8, 2021, 7:04pm

Thanks for the reply.
I’ve tried running as admin in both the command line and through the IDE’s terminal. The result was exactly the same though I’ve noticed it might be necessary to allow access to the GPU performance counters to all users as mentioned here:

That being said, it did work when the argument –profile-child-processes was added and it looks like it was the only issue.

Topic		Replies	Views
Numba Cuda error Support: What is this error message?	3	633	September 19, 2023
The following packages have unmet dependencies: Numba	1	502	November 14, 2022
How do I profile code including numba kernels decorated with @cuda.jit? Is there a good way to apply nsight systems and nsight compute tools that replace nvprof? Support: How do I do ...?	4	1298	May 12, 2023
Profiling Numba CUDA with nsight sys Support: How do I do ...?	1	973	July 5, 2022
Ubuntu 22.04 fresh install with conda/numba does not see GPU Numba	25	3363	April 26, 2023

CUDA - Nvprof error?

Related topics