Understanding memory usage of a kernel

marcelosena · November 11, 2024, 1:14am

Hi

I am trying to understand the measurement of memory usage in Numba.

I am allocating here an array with 10^9 float 32 numbers, which I expect to allocate 4gb of memory.

However, when trying to measure the allocation, i get an apparenty much smaller allocation.

MemoryInfo(free=6140526592, total=6241124352)

Here is the code I use:

from numba import cuda
import numpy as np

@cuda.jit
def cuda_run(arr):
	thread = cuda.grid(1)
	arr[thread] = 1.0

tot_calc = 10**9
arr = np.zeros(tot_calc, dtype = 'float32')

print(cuda.select_device(0))

threadsperblock = 128
blockspergrid = math.ceil(tot_calc / threadsperblock)
start = time.time()
cuda_run[blockspergrid, threadsperblock](arr)
end = time.time()

print(cuda.current_context().get_memory_info())

del cuda_run
print(cuda.current_context().get_memory_info())

cuda.current_context().reset()
print(cuda.current_context().get_memory_info())

cuda.driver.driver.reset()
print(cuda.current_context().get_memory_info())

Is this the expected behavior? Is my memory calculation wrong? Or the way I calculate it?

Thanks and any help is appreciated!

marcelosena · November 11, 2024, 9:29pm

Ok, I think I managed to understand this myself. Apparently Numba in the background does all the necessary cleanup of memory, so the numbers before do not actually reflect proper memory allocation. To get an accurate measure, explicitly transferring the arrays to the device using the .to_device methods delivered the numbers I was expecting.

Still, any confirmation here would be very appreciated! Thanks!

gmarkall · November 18, 2024, 8:58am

Your explanation sounds correct - thanks for following up with the answer!

Topic		Replies	Views
BUG: Numba using a lot of GPU memory Development	3	641	September 20, 2024
How to free the GPU memory once computations are done using the device_arrays Community Support	3	2592	May 18, 2023
Host Side Memory Allocation (for CUDA) Development	4	1170	June 26, 2020
What are the initialised values of numba.cuda.device_array()? Community Support	1	1116	March 25, 2022
Memory grows all time Community Support	7	2159	November 17, 2021

Understanding memory usage of a kernel

Related topics