Since input type of kernel function could be np.array, so we dont need to do to_device explicitly?

shouldnotfail · October 26, 2022, 5:54am

I found that kernel function also accept np.array, it is recommended to use np.array as input to reduce the lines of code or even get performance gain?

gmarkall · October 28, 2022, 3:59pm

No, the opposite - it’s recommended to use device arrays so that you don’t force implicit data transfers and synchronization with the device.

If you do pass NumPy arrays, Numba emits a warning:

NumbaPerformanceWarning: Host array used in CUDA kernel will incur copy overhead to/from device.

shouldnotfail · November 2, 2022, 4:06pm

thanks for the advice! btw the, is there any way init device array with certain value? now I have to use numpy method and then move the data to gpu

Topic		Replies	Views
Is there a way to pass list of arrays to CUDA kernel? Support: How do I do ...?	2	230	June 16, 2023
Reading a DeviceNDArray on the GPU Support: How do I do ...?	11	1208	March 16, 2021
Copy an python object to device? Support: What is this error message?	2	663	May 11, 2021
Cannot create a shared array in a kernel using kernel parameters Community Support	3	876	February 5, 2021
How to perform sorting in Numba/CUDA kernels without using local.cuda.array? Support: How do I do ...?	1	733	August 21, 2021

Since input type of kernel function could be np.array, so we dont need to do to_device explicitly?

Related Topics