Jan 28 Invited Talk on Proteus

sklam · January 22, 2025, 4:58pm

Hello everyone,

We are excited to announce an invited talk on Jan 28 to take place during the office-hour/dev-meeting time slot. The session will be recorded if you cannot make it to the event live. Please see our calendar for instruction to join: Numba Dev Meetings

Proteus: JIT compilation and runtime optimization for C/C++ using LLVM

Abstract: In high-performance computing (HPC), fast application execution is the primary objective. HPC software is developed in high-performance languages (C/C++, Fortran) and is statically compiled ahead-of-time (AOT) using optimizing compilers to generate fast code. AOT compilation optimizes source code with only limited information available at compile time, but that precludes leveraging runtime information for additional optimizations.

We propose Proteus, an easy-to-use, portable, and lightweight Just-In-Time (JIT) compilation approach to optimize GPU kernels at runtime. Proteus introduces a user-friendly, annotation-based interface to dynamically extract, compile, and optimize kernels for runtime parameters, with minimal overhead by using LLVM IR and caching. Benchmark results show that Proteus significantly speeds up end-to-end execution, up to 2.8x on AMD and 1.78x on NVIDIA GPUs compared to AOT compilation. Additionally, Proteus outperforms NVIDIA’s CUDA-specific Jitify tool by an average of 1.23x, thanks to reduced overhead and faster device code in certain cases.

sklam · January 28, 2025, 8:58pm

If you missed the live talk, you can find the recording at: https://www.youtube.com/watch?v=JNT5nDlMeS8

Topic		Replies	Views
About Numba LLVM 20 support and Better PTX control Numba	3	111	June 16, 2025
Improving Numba for CPU workloads Numba	10	277	June 27, 2025
Using Numba to compile a Python algorithm into target assembly Numba	1	614	June 26, 2023
CUDA: Experimental branch with support for -dlcm=cg Numba	0	463	August 31, 2022
How do I profile code including numba kernels decorated with @cuda.jit? Is there a good way to apply nsight systems and nsight compute tools that replace nvprof? Support: How do I do ...?	4	1310	May 12, 2023

Jan 28 Invited Talk on Proteus

Proteus: JIT compilation and runtime optimization for C/C++ using LLVM

Related topics