Flash-Attn Backward Compilation Progress | Generated by AI

Home 2026.06

Good progress:

It’s working through the backward pass kernels now. Still got a ways to go — flash-attn has ~100+ kernel variants (fwd/bwd × fp16/bf16 × head dims 64/96/128/192/256). Should be done in ~20-40 min.


Back Donate