Bodo is a JIT compiler (based on Numba) and HPC-based compute engine for large-scale data processing in native Python (using Pandas/Numpy). It’s a lot easier and 20-240x faster than others like Spark, Dask, and Ray (see benchmark in repo).
Bodo generates highly optimized, parallel binaries without requiring code rewrites. It eliminates runtime overheads common in driver-executor models by leveraging Message Passing Interface (MPI) for true distributed execution.
Pandas support in Numba has been a common request which Bodo provides (and a lot more).
Give it a spin and let us know what you think—we’d love your feedback!