Back to feed

b7950

Feb 5, 2026
Meta/llama.cppCLIvb7950

vulkan: Preprocess FA mask to detect all-neg-inf and all-zero. (#19281)

Write out a 2-bit code per block and avoid loading the mask when it matches these two common cases.

Apply this optimization when the mask is relatively large (i.e. prompt processing).

macOS/iOS:

Linux:

Windows:

openEuler: