Dynamically adjust size of cuda.local.array without environment variables

gmarkall · October 18, 2021, 9:41am

Are you looking for something that varies the size of a local array in a kernel? e.g.:

from numba import cuda, types


def gen_kernel(local_size):
    @cuda.jit
    def f():
        x = cuda.local.array(local_size, types.int32)
        print("Length of local array is", len(x))

    return f


# Generate kernels with different sized local arrays and launch them with a
# single thread
gen_kernel(2)[1, 1]()
gen_kernel(3)[1, 1]()

# Synchronize to make sure we don't exit before the output can be printed
cuda.synchronize()

which outputs:

Length of local array is 2
Length of local array is 3

Topic		Replies	Views
Unable to create an empty array inside the device function Support: How do I do ...?	4	862	October 16, 2022
How do I use global memory with numba.cuda? Support: How do I do ...?	10	1913	August 4, 2021
How to perform sorting in Numba/CUDA kernels without using local.cuda.array? Support: How do I do ...?	1	735	August 21, 2021
Cannot create a shared array in a kernel using kernel parameters Community Support	3	876	February 5, 2021
CUDA Python - Multiple Threads Operating on Same Array Location Support: How do I do ...?	6	1308	January 21, 2022

Dynamically adjust size of cuda.local.array without environment variables

Related Topics