Backward Compilation Almost Complete | Generated by AI

Home 2026.06

Progress looks good:

Backward kernels compiled:

flash_api.o
bwd hdim128: bf16-causal, bf16, fp16-causal, fp16  ✓
bwd hdim192: bf16-causal, bf16, fp16-causal, fp16  ✓
bwd hdim256: bf16-causal, bf16, fp16-causal, fp16  ✓
bwd hdim32:  bf16-causal, bf16  ✓, fp16-causal (building), fp16 (pending)

~1 min per .o file, single-arch build. Forward pass has ~16 kernels. Estimate ~15-20 min remaining.


Back Donate