Batched Dot Products

sschaer · October 12, 2023, 4:45pm

Matrix-vector multiplication can always be done the naive way like so:

@nb.njit(fastmath=True, parallel=False)
def nb_multiply(a, b):
    n, m, l = a.shape
    out  = np.empty((n, m))
    for i in nb.prange(n):
        for j in range(m):
            val = 0
            for k in range(l):
                val += a[i, j, k] * b[k]
            out[i, j] = val 
    return out

It will even almost always (maybe even always) perform best because you have full control over parallelization and no overhead.

Here also something related that you might find interesting: Help needed to re-implement np.matmul for 4D and 5D matrix - #2 by sschaer

Topic		Replies	Views
Numba supporting numpy.dot Support: How do I do ...?	3	1045	December 13, 2023
How to solve a batch of linear systems using numpy.linalg.solve()? Support: How do I do ...?	3	4030	November 17, 2021
What is an alternative to unsupported NumPy function 'numpy.matmul' Support: How do I do ...?	2	1147	October 17, 2023
Numba with cross product is 10x slow than numba with loop Community Support	7	3358	August 4, 2020
Help needed to re-implement np.matmul for 4D and 5D matrix Support: How do I do ...?	2	425	August 19, 2023

Batched Dot Products

Related topics