How is `len` implemented?

tornikeo · April 4, 2025, 12:21pm

This, amazingly, works:

from numba import cuda
import numpy as np

@cuda.jit
def kernel(arr):
    idx = cuda.grid(1)
    print(len(arr))

n = 100
arr = cuda.to_device(np.arange(n))
threadsperblock = 32
blockspergrid = (n + threadsperblock - 1) // threadsperblock
kernel[blockspergrid, threadsperblock](arr)

Printing 100. How is this possible? Internally, how is the shape of the array arr passed into the kernel?

Topic		Replies	Views
Dynamically adjust size of cuda.local.array without environment variables Support: How do I do ...?	5	1378	October 18, 2021
Making Awkward Arrays work in the CUDA target Community Support	4	1349	March 8, 2023
Understanding memory usage of a kernel Support: How do I do ...?	2	102	November 18, 2024
Unable to create an empty array inside the device function Support: How do I do ...?	4	1272	October 16, 2022
Kernel within a kernel Support: How do I do ...?	10	1345	August 1, 2024

How is `len` implemented?

Related topics