Back to feed

b7632

Jan 5, 2026
Meta/llama.cppCLIvb7632

vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515)

  • vulkan: handle quantize_q8_1 overflowing the max workgroup count

  • vulkan: Fix small tile size matmul on lavapipe

  • fix mul_mat_id failures

macOS/iOS:

Linux:

Windows:

openEuler: