Blurhash-numba project feedback

animator · September 25, 2020, 2:37am

Hi Numba community,

I have just released the first version of blurhash implementation in numba. You can find the project here - https://github.com/animator/blurhash-numba

It would be great if you can share some feedback (any numba-jitsu is welcome) so that I can improve its performance even further.

Thanks & Regards,
Ankit

sklam · September 25, 2020, 7:52pm

That’s a cool project. I noticed one thing when I glanced through the code. When spelling array like this nb.float64[:, :, :] (from here), it might be missing some optimization because:

>>> import numba as nb
>>> nb.float64[:, :, :]
array(float64, 3d, A)

which is an array type of unknown memory layout (“A” means any layout).

If you use the following instead:

>>> nb.float64[:, :, ::1]
array(float64, 3d, C)

The compiler knows that it is C-contiguous and can emit better loops.

To make it generic, I usually just let the type inference do the work to provide the most precise type info for each argument.

animator · September 25, 2020, 11:47pm

Thank you @sklam for this awesome feedback
The whole reason I went ahead with Eager compilation instead of Lazy compilation (let the type inference do the work) because I wanted to reduce the runtime of the first run. I am still to run some tests on a server less framework (like GCP) to see if Eager compilation vs Lazy compilation matters. So I implemented it theoretically as I not aware how these frameworks cache the compiled code.
Any views on my assumptions will be highly valued.
Thank you again for your feedback.

Regards,
Ankit

sklam · October 1, 2020, 4:58pm

I’m not familiar with the behavior of GCP. But the eager compilation would be transferring the compilation cost to the import time. If your script is running in a fresh process each time, the compilation overhead will be the same. You might want to explore caching the compilation result e.g. @jit(cache=True).

animator · October 3, 2020, 1:29am

Sure @sklam,

In my past experience it takes a certain cold start time to setup environment until the API(function) is ready to use on GCP. The API also goes cold if it is not being used for certain minutes and gets restarted time-to-time. Does it set-up a new instance for every cold start, I will have to investigate as it will render caching ineffective. Currently, I went ahead with eager compilation to transfer the compilation cost to the cold start time. I will investigate it further and see if I am able to eliminate this cost whole-together.

Thank you for your inputs.

Regards,
Ankit

Topic		Replies	Views
Compilation pipeline, compile time and vectorization Numba	10	1632	May 31, 2024
Comparison between Numba and Fortran code Numba	3	561	December 13, 2023
Long compilation times for simple but long functions Community Support	4	922	October 25, 2022
Array multiplication - speed comparison Community Support	4	255	January 20, 2024
Performance question / issue on particular problem Support: How do I do ...?	19	1173	January 20, 2021

Blurhash-numba project feedback

Related topics