A huge performance penalty for this simple function

Somewhat related to the unpredictability in performance variation - I posted this question a while ago about parallel performance.
https://numba.discourse.group/t/floyd-warshalls-in-numba-vs-c-openmp/1719

The variation in just moving the minimum into a separate function was very noticeable, not sure why.