c++ - Speed up float 5x5 matrix * vector multiplication with SSE

Question

Welcome To Ask or Share your Answers For Others

c++ - Speed up float 5x5 matrix * vector multiplication with SSE

posted Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

c++ - Speed up float 5x5 matrix * vector multiplication with SSE

I need to run a matrix-vector multiplication 240000 times per second. The matrix is 5x5 and is always the same, whereas the vector changes at each iteration. The data type is float. I was thinking of using some SSE (or similar) instructions.

I am concerned that the number of arithmetic operations is too small compared to the number of memory operations involved. Do you think I can get some tangible (e.g. > 20%) improvement?
Do I need the Intel compiler to do it?
Can you point out some references?

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-23T18:58:54+0000

The Eigen C++ template library for vectors, matrices, ... has both

optimised code for small fixed size matrices (as well as dynamically sized ones)
optimised code that uses SSE optimisations

so you should give it a try.

Categories

c++ - Speed up float 5x5 matrix * vector multiplication with SSE

c++ - Speed up float 5x5 matrix * vector multiplication with SSE

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags