Back to feed

b7841

Jan 27, 2026
Meta/llama.cppCLIvb7841

opencl: add flattened q6_K mv (#19054)

  • opencl: flatten q6_K and add kernel_mul_mv_q6_K_f32_flat

  • opencl: clean up

  • opencl: refactor q6_K mv - put loop body in block_q_6_K_dot_y_flat

  • opencl: tweak the workgroup size a bit

  • opencl: output 4 values per subgroup for kernel_mul_mv_q6_K_f32_flat

  • opencl: proper alignment for q6_K

  • opencl: boundary handling for flattened q6_K mv

  • opencl: rename q6_K mv kernel file

  • opencl: put flattened q6_K mv in its own file

  • opencl: use lower k in file name

  • opencl: use K in variable names

macOS/iOS:

Linux:

Windows:

openEuler: