BUG: Numba using a lot of GPU memory

gamecraftCZ · October 12, 2020, 11:24am

Hi everyone. Probably I found a bug in Numba. I created a reproducable demo. It should allocate only 256KB of memory but allocates more than 2GB. Can someone take a look?

import numba as nb
from numba import cuda
from time import sleep

######################

@cuda.jit("void(int32[:])", device=True)
def GPU_device_function(arr):
    return

@cuda.jit("void()")
def GPU_entry_point():
    # When this if is removed, it works normally
    if cuda.grid(1):
        return

    # Should use only 256 KB of memory.
    arr = cuda.local.array(shape=65536, dtype=nb.int32)

    # When this assigment is removed, it works normally
    arr[0] = 0

    # When this call is removed, it works normally
    GPU_device_function(arr)

######################

if __name__ == '__main__':
    print(cuda.select_device(0))
    print("LOADED")

    GPU_entry_point[1, 1]()  # Run  once
    cuda.synchronize()

    print("DONE")
    sleep(3)  # Wait, so the memory spike will show up in Task manager before deallocation
    print("END")

gmarkall · October 12, 2020, 1:16pm

Thanks for the report! I can reproduce the behaviour, except with a Quadro RTX 8000 it uses 18GB of RAM!

I’ve written it up on the issue tracker, and will report further progress there: https://github.com/numba/numba/issues/6352

Topic		Replies	Views
How to free the GPU memory once computations are done using the device_arrays Community Support	3	1409	May 18, 2023
Numba for CUDA Programmers course released Announcements	0	654	April 23, 2021
How do I use global memory with numba.cuda? Support: How do I do ...?	10	1904	August 4, 2021
Writing to global memory is slow? Support: How do I do ...?	3	264	August 2, 2023
Host Side Memory Allocation (for CUDA) Development	4	997	June 26, 2020

BUG: Numba using a lot of GPU memory

Related Topics