Functions involving random numbers

rhk217 · December 19, 2020, 3:18pm

Numba has support for both standard python random() module and the np.random module, which is great as numpy.random functions are generally quite slow.
I was wondering if the vanilla python/numpy random is different in any way from a numba implementation.

That is, (1) I have a function that generates some random numbers, and (2) another function which is njited version of 1.
Is the behavior of these functions different in any characteristic way? Will the random numbers generated in both functions be of the same “quality”?

stuartarchibald · December 21, 2020, 12:10pm

Hi @rhk217

From memory, Numba replicates the standard NumPy results exactly, if you give Numba and NumPy the same “seed” as a starting state then the respective RNGs will emit the same sequence. Numba’s RNG is threadsafe too with the individual thread "state"s kept in thread local storage. Example of parity:

from numba import njit
import numpy as np

@njit
def foo(n, seed=0):
    np.random.seed(seed) # set state
    ret = np.empty(n)
    for i in range(n):
        ret[i] = np.random.random()
    return ret


n = 10

for x, y in zip(foo(n), foo.py_func(n)): # run numba and python version
    assert x == y
    print(x, y)

gives:

0.5488135039273248 0.5488135039273248
0.7151893663724195 0.7151893663724195
0.6027633760716439 0.6027633760716439
0.5448831829968969 0.5448831829968969
0.4236547993389047 0.4236547993389047
0.6458941130666561 0.6458941130666561
0.4375872112626925 0.4375872112626925
0.8917730007820798 0.8917730007820798
0.9636627605010293 0.9636627605010293
0.3834415188257777 0.3834415188257777

hope this helps?

rhk217 · December 21, 2020, 2:07pm

@stuartarchibald

Thanks for the detailed answer. That definitely puts me at ease about using random variables in jitted functions.

I was also surprised to see that jitted random functions are so much faster than the standard numpy ones.

%timeit foo(n)                    #> 2.46 µs ± 155 ns per loop
%timeit foo.py_func(n)            #> 9.17 µs ± 278 ns per loop

# The above function but with the following line changed
# ret[i] = np.random.randint(n)
%timeit foo(n)                    #> 2.4 µs ± 122 ns per loop
%timeit foo.py_func(n)            #> 34 µs ± 2.09 µs per loop

About 3-4 times faster for np.random.random() and ~10-100 times faster(!) for np.random.randint().
That’s just too good.

In fact from now on, I’m always going to wrap numpy random functions inside an njit function.

Topic		Replies	Views
How to use np.random.default_rng with numba jit Community Support	0	688	April 12, 2022
Speeding up random array generation compared to numpy Community Support	2	712	February 26, 2024
Random array generation : numba cuda slower than cupy? Support: How do I do ...?	3	1923	July 23, 2021
Numba not recognizing numpy.random.Generator obj Support: What is this error message?	9	1329	March 17, 2023
Best Practices Adopting Numpy Based Code for Compatibility/Performance with Numba Support: How do I do ...?	12	394	November 2, 2024

Functions involving random numbers

Related topics