Optimizing a Rust GPU matmul kernel
I read the excellent post [Optimizing a WebGPU Matmul Kernel for 1TFLOP+... (more…)
Read more »
If you’re looking to write fast code in Rust, good news! Rust makes it really
easy to write really fast code. The focus on zero-cost abstractions, the
lack of implicit boxing and the static memory management means that even naïve
code is often faster than… Read more