Nested numba functions slow down execution and cannot be inlined

aseyboldt · February 6, 2023, 4:32pm

I just ran that example on the branch with only global optimization mentioned here, but it seems that only has a minor impact:

%timeit fct_1_nested(s1, s2)
%timeit fct_1(s1, s2)

# Default optimizations in numba
2.52 ms ± 11.4 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
116 µs ± 376 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

# merged-compile branch
2.32 ms ± 18.2 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
120 µs ± 925 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

The merged compilation is a little bit faster, but it doesn’t look like it is the main reason for the performance difference.

Topic		Replies	Views
Why this function call is faster than inlined version? Numba	3	313	May 15, 2023
Numba with multiprocessing Support: How do I do ...?	1	3491	January 10, 2022
3x slowdown in parent function when applying njit Community Support	11	517	July 16, 2020
Numba lennard-jones example is better with function nesting? Community Support	3	211	October 10, 2023
Passing namedTuple to a jitted function is slow Community Support	1	604	April 25, 2022

Nested numba functions slow down execution and cannot be inlined

Related Topics